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RAPID IMMUNOS ELECTION CLONING METHOD 

Background 

A basic tool in the field of recombinant genetics is 
the conversion of poly (A)" 1 " mRNA to double-stranded (ds) 
0 5 cDNA, which then can be inserted into a cloning vector 
and expressed in an appropriate host cell. Molecular 
cloning methods for ds cDNA have been reviewed, for 
example, by Williams, "The Preparation and Screening of a 
cDNA Clone Bank," in Williamson, ed., Genetic Engineer- 
10 ing , Vol. 1, p. 2, Academic Press, New York (1981); 
Maniatis, "Recombinant DNA" , in Prescott, ed. , Cell 
Biology / Academic Press, New York (198 0) ; and 
Efstratiadis et al . , "Cloning of Double-Stranded DNA, 11 in 
Stelo et ah , Genetic Engineering , Vol. 1, p. 15, Plenum 
15 Press, New York (1979). 

A substantial number of variables affect the suc- 
cessful cloning of a particular gene and cDNA cloning 
strategy thus must be chosen with care, A method common 
to many cDNA cloning strategies involves the construction 
2 0 of a "cDNA library" which is a collection of cDNA clones 
derived from the total poly (A) 4 " mRNA derived from a cell 
of the organism of interest. 

A mammalian cell may contain up to 30,000 different 
irtRNA sequences, and the number of clones required to 
25 obtain low-abundance mRNAs, for example, may be much 

greater. Methods of constructing genomic eukaryotic DNA 
libraries in different expression vectors, including 
bacteriophage A , cosmids, and viral vectors, are known. 
Some commonly used methods are described, for example, in 
30 Maniatis et al . , Molecular Cloning: A Laboratory Manual , 
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Cold Spring Harbor Laboratory, publisher. Cold Spring 
Harbor, New York: (1982) . 

Once a genomic cDNA library has been constructed, it 
is necessary to isolate from the thousands of- host cells 

05 the cell containing the particular human gene of 

interest. Many different methods of isolating target 
genes from cDNA libraries have been utilized, with 
varying success. These include, for example, the use of 
nucleic acid probes, which are labeled mRNA fragments 

10 having nucleic acid sequences complementary to the DNA 

sequence of the target gene. When this method is applied 
to cDNA clones of abundant mRNAs in transformed bacterial 
hosts, colonies hybridizing strongly to the probe are 
likely to contain the target DNA sequences. The identity 

15 of the clone then may be proven, for example, by in situ 
hybridization/selection (Goldberg et ah , Methods 
Enzymol . , 68:206 (1979)) hybrid-arrested translation 
(Paterson et al « , Proceedings of the National Academy of 
Sciences, 74:4370 (1977)), or direct DNA sequencing 

20 (Maxam and Gilberts Proceedings of the National Academy 

of Sciences , 74:560 (1977); Maat and Smith, Nucleic Acids 
Res. , 5:4537 (1978) ) „ 

Such methods, however, have major drawbacks when the 
object is to clone mRNAs of relatively low abundance from 

2 5 cDNA libraries. For example, using direct in situ colony 
hybridization, it is very difficult to detect clones 
containing cDNA complementary to mRNA species present in 
the initial library population at less than one part in 
200. As a result, various methods for enriching mRNA in 

30 the total population (e.g. size fractionation, use of 
synthetic oligodeoxynucleotides , differential 
hybridization, or immunopurif ication) have been developed 
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and are often used when low abundance mRNAs are cloned. 
Such methods are described, for example, in Maniatis et 
al . , Molecular Cloning: A Laboratory Manual , supra . 

Many functional eukaryotic proteins initially exist 
0 5 in the form of precursor molecules which contain leader 
or signal sequences at their N-terminal ends. These 
leader sequences bind to the cell membrane and draw the 
remainder of the protein through the lipid bilayer, after 
which the signal sequence is cleaved from the protein by 
10 a signal peptidase enzyme. The protein thus functions 
only after secretion from the cells (for example, 
insulin , serum albumin, antibodies, and digestive tract 
enzymes) , or after the proteins have been anchored to the 
outer surface of a cell membrane (for example, histocom- 
15 patibility antigens) • 

The cell surface antigens characteristic of 
mammalian T lymphocytes are additional examples of 
proteins that anchor to the cell surface. In mammals, 
certain cells derived from bone marrow mature into 
20 lymphocytes, which are present in the lymphoid organs, 
including the thymus, spleen, lymph nodes, and lymphoid 
aggregates, and also circulate actively through the blood 
and lymph systems. Mature lymphocyte cells may be 
divided into two populations: thymus -dependent (T) 

2 5 lymphocytes and thymus-independent (B) lymphocytes. T 

lymphocytes migrate to the interior of the thymus, where 
they undergo dif f erentiative proliferation. During their 
differentiation process, they express characteristic cell 
surface membrane alloantigens , including Thy-1, TLA, 

3 0 gv-i, Ly-l, Ly-2, Ly-3 , and Ly-5 . As they mature, T 

lymphocytes lose the TLA antigens and some of the Thy-1 
antigens, and gain histocompatibility antigens, acquiring 
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the membrane conformation typical of the recirculating T 
lymphocytes. This is described, for example, by Mota, 
"Activity of Immune Cells, " in Bier et al . , eds., 
Fundamentals of Immunology , 2d Ed, , Springer-Verlag, 

05 Berlin, pp. 35-62 (1986) - 

T lymphocytes are involved indirectly in the 
formation of antibodies and their activities thus have 
required complex analysis of cell function, rather than 
simple antibody titer measurement • Partly due to this, 

10 their importance in development of immunologic competence 
was not recognized until relatively recently. Mature T 
lymphocytes synthesize and express an unique pattern of 
surface glycoprotein antigens which serve as markers for 
identification of different T lymphocyte subpopulations, 

15 including T helper cells, T suppressor cells, and T 

cytotoxic cells. Each of these subpopulations plays a 
very important role in regulating the immune system. 
(Mota, supra ) „ 

In humans, the functional and phenotypic heterogene- 

20 ity of T lymphocytes is well accepted „ Two major subpop- 
ulations are known: effector T cells mediating cellular 
immunity; and regulator T cells containing helper and 
suppressor T lymphocytes. These two subpopulations have 
been defined with heteroantisera , autoantibodies, and 

25 monoclonal antibodies directed at celi surface antigens. 

For example, earlier in their development, human lymphoid 
cells in the thymus express an antigen designated Til 
which reacts strongly to a monoclonal antibody designated 
Cluster of Differentiation 2 (CD2), and react slightly 

3 0 with monoclonal antibody CDS to cell surface antigen Tl. 
During maturation, these cells lose Til (CD2) and acquire 
three new antigens defined by monoclonal antibodies CD4 , 
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CD8, and CD1. With further maturation, the thymocytes 
cease to express cell surface antigens reactive with 
monoclonal antibody GDI, express the T3 antigen reactive 
with monoclonal antibody CD3 , and then segregate into two 
0 5 subpopulations which express either T4 (CD4) or T8 (CD8) 
antigen. Immunologic competence is acquired at this 
stage, but is not completely developed until thymic 
lymphocytes migrate outside the thymus. (Mota, supra.) 
In contrast with the majority of thymocytes, circulating 
10 T lymphocytes express the Tl (CDS) and T3 (CD3) antigens. 
The T4 (CD4) antigen is present on approximately 55-65% 
of peripheral T lymphocytes, whereas the T8 (CD8) antigen 
is expressed on 2 0-3 0%. These two subpopulations corre- 
spond to helper and to suppressor and cytotoxic T cells, 
15 respectively. 

In addition to providing a convenient means of 
distinguishing T lymphocyte subpopulations, these cell 
surface antigens are important for mature T cell ac- 
tivation and effector function. T cell activation 
20 involves a complex series of cell surface interactions 

between the T cell and the target cell or stimulator cell 
in addition to binding of the T cell receptor to its 
specific antigen . 

For example, CD2 , the human T cell erythrocyte 
25 receptor, allows thymocytes and T-lymphocytes to adhere 
to target cells (e.g., erythrocytes) and to thymic 
epithelium. This occurs via a specific molecular ligand 
for CD2, designated LFA-3 , in humans, which is a widely 
distributed surface antigen. This phenomenon has long 
30 been employed to detect, assay and purify human cells 

producing antibodies to sheep erythrocytes and serves as 
the basis for the E-rosette test, first described by 
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Zaalberg, Nature , 202 :1231 (1964). CD2/LFA-3 
interactions also have been shown to mediate cytolytic 
target conjugation (Shaw et al. f Nature, 323 : 262-264 
(198 6) , and the mixed lymphocyte reaction (Martin et al . , 

05 J. Immunol o , 131 :180-185 (1983), Anti-CD2 monoclonal 

antibodies can directly activate peripheral T-lymphocytes 
via an antigen-independent pathway (Meuer et al« , Cell, 
3_6:897-906 (1984), indicating an even wider 
immunoregulatory role for CD2- 

10 Recognition that T lymphocytes are the main 

effectors of cell— mediated immunity and also are involved 
as helper or suppressor cells in modulating the immune 
response has resulted in a significant contribution to 
the increasing practical application of clinical 

15 immunology to medicine. The scope of this application 
includes defense against infections, prevention of 
diseases by immunization, organ transplantation, blood 
banking, deficiencies of the immune system, and a variety 
of disorders that are mediated by immunologic mechanisms. 

20 Moreover, immunologic techniques frequently are used in 
the clinical laboratory, as in the measurement of hor- 
mones and drugs o Clinical immunology is described, for 
example, in Weir, ed, , Handbook of Experimental 
Immunology in Four Volumes: Volume 4: Applications of 

25 Immunological Methods in Biomedical Sciences , 4th Ed. , 
Blackwell Scientific Publications, Oxford (198 6) ; 
Bogus laski et al. , eds„. Clinical Immunochemistry: 
Principles of Methods and Applications , Little, Brown & 
Co,, Boston (1984); Holborow et al . , eds.. Immunology in 

30 Medicine: A Comprehensive Guide to Clinical Immunology , 
2d Ed-, Grune & Stratton, London (1983); and Petersdorf 
et al. , eds., Harrison's Principles of Internal Medicine , 
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10th ed. , McGraw-Hill, New York, publisher, pp. 344-391 
(1983) . Clearly, a more thorough understanding of the 
proteins which mediate the immune system would be of 
significant value in clinical immunology. 
05 Use of mammalian expression libraries to isolate 

cDNAs encoding mammalian proteins such as those described 
above would offer several advantages. For example, the 
protein expressed in a mammalian host cell should be 
functional and should undergo any normal 
10 posttranslational modification. A protein ordinarily 

transported through the intracellular membrane system to 
the cell surface should undergo the complete transport 
process. A mammalian expression system also would allow 
the study of intracellular transport mechanisms and of 
15 the mechanism that insert and anchor cell surface 
proteins to membranes. 

One common mammalian host cell, called a "COS" cell, 
is formed by infecting monkey kidney cells with a mutant 
viral vector, designated simian virus strain 4 0 (SV4 0) , 
2 0 which has functional early and late genes, but lacks a 
functional origin of replication- In COS cells, any 
foreign DNA cloned on a vector containing the SV4 0 origin 
of replication will replicate because SV40 T antigen is 
present in COS cells. The foreign DNA will replicate 
25 transiently, independently of the cellular DNA. 

With the exception of some recent lymphokine cDNAs 
isolated by expression in COS cells (Wong, G.G., et al . , 
Science , 228 :810-815 (1985); Lee, F. et al. , Proceedings 
of the National Academy of Sciences, USA , 83 : 2061-2065 
30 (1986); Yokota , T., et al, , Proceedings of the National 
Academy of Sciences, USA , 83:5894-5898 (1986); Yang, Y. , 
et al. , Cell , 47:3-10 (1986)), however, few cDNAs in 
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general are isolated from mammalian expression libraries. 
There appear to be two principal reasons for this: 
First, the existing technology (Okayama, H- et al . , Mol. 
Cell. Biol., 2:161-170 (1982)) for construction of large 
plasmid libraries is difficult to master, and library 
size rarely approaches that accessible by phage cloning 
techniques. (Huynh, T. et al. . In: DNA Cloning Vol. i f a 
Practical Approach, Glover, D.M. (ed. ) , IRL Press, Oxford 
(1985), pp. 49-78). Second, the existing vectors are, 
with one exception (Wong, G.G., et al. , Science , 
228:810-815 (1985)), poorly adapted for high level 
expression, particularly in COS cells. The reported 
successes with lymphokine cDNAs do not imply a general 
fitness of the methods used, since these cDNAs are 
particularly easy to isolate from expression libraries. 
Lymphokine bioassays are very sensitive ((Wong, G.G. , et 
al^, Science, 228:810-815 (1985)? Lee, F. et al. # 
Proceeding s of th e National Academy of Sciences, USA . 
83:2061-2065 (1986); Yokota, T. et al. , Proceedings of 
the National Acade my of Sciences, USA . 83:5894-5898 
(1986),- yang, Y. et al, , Cell , 47:3-10 (1986)) and the 
mRNAs are typically both abundant and short (Wong, G.G. 
et a1 - ' §£ience, 228:810-815 (1985); Lee, F. , et al. . 
Proceedings of the National Acad emy of Sconces, usa ! 
83:2061-2065 (1986); Yokota, T. , et.al. Proceedings of 
the Nati onal Academy of Sciences, usa . 83:5894-5898 
(1986); Yang, Y. , et al. , Cell . 47:3-10 (1986)). 

Thus, expression in mammalian hosts previously has 
been most frequently employed solely as a means of 
verifying the identity of the protein encoded by a gene 
isolated by more traditional cloning methods. For 
example, Stuve et al., J. Virol. . 61 f 2 ) : 327-335 (1987), 
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cloned the gene for glycoprotein gB2 of herpes simplex 
type II strain 33 3 by plaque hybridization of M13-based 
recombinant phage vectors used to transform competent E . 
coli JM101. The identity of the protein encoded by the 
0 5 clone thus isolated was verified by transfection of 
mammalian COS and Chinese hamster ovary (CHO) cells. 
Expression was demonstrated by immunofluorescence and 
radioimmunoprecipitation. 

Oshiraa et al . used plague hybridization to screen a 
10 phage lambdagtll cDNA library for the gene encoding human, 
placental beta-glucuronidase . Oshima et al . , Proceedings 
of the National Academy of Sciences, U.S.A. , 84 : 685-689 
(1987) . The identity of isolated cDNA clones was 
verified by immunoprecipitation of the protein expressed 
15 by COS-7 cells transfected with cloned inserts using the 
SV4 0 late promoter. 

Transient expression in mammalian cells has been 
employed as a means of confirming the identity of genes 
previously isolated by other screening methods. Gerald 
20 et al . , Journal of General Virology , 67:2695-2703 (1986). 
Mackenzie, Journal of Biological Chemistry , 261 ; 14112- 
14117 (1986); Seif et al . , Gene , 4^:1111-1121 (1986); 
Orkin et al . , Molecular and Cellular Biology , 5 (4) :762- 
767 (1985) . These methods often are inefficient and 
25 tedious and require multiple rounds of screening to 
identify full-length or overlapping clones. Prior 
screening methods based upon expression of fusion 
proteins are inefficient and require large quantities of 
monoclonal antibodies. Such drawbacks are compounded by 
30 use of inefficient expression vectors, which result in 
protein expression levels that are inadequate to enable 
efficient selection. 
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Summary of the Invention 

The present invention relates to a powerful new 
method for cloning cDNA encoding cell surface antigens, 
to a method of constructing cDNA libraries to high 
05 efficiency expression vectors particularly suited for 
high level expression in eukaryotic host cells, and to 
the isolated nucleotide sequences and their encoded 
products . 

The highly efficient cloning technique of the 
present invention is based upon transient expression of 
antigen in eukaryotic cells and physical selection of 
cells expressing the antigen by adhesion to an 
antibody-coated substrate, such as a culture dish. The 
methods of the present invention are useful for the 
isolation and molecular cloning of any protein which can 
Call \^4h^ be ex P resse <* and transported to the cell surface membrane 

of a eukaryotic cell. 

The method for cloning cDNA encoding a cell surface 
antigen of the present invention comprises preparing a 
2 0 cDNA library? introducing this cDNA library into 

eukaryotic mammalian preferably tissue culture cells ; 
culturing these cells under conditions allowing 
expression of the cell surface antigen; exposing the 
cells to a first antibody or antibodies directed against 
the cell surface antigen, thereby allowing the formation 
of a cell surface antigen-first antibody complex ; 
subsequently exposing the cells to a substrate coated 
with a second antibody directed against the first 
antibody, thereby causing cells expressing the cell 
surface antigen to adhere to the substrate via the 
formation of a cell surface antigen-first antibody-second 
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antibody complex; and separating adherent from 
non-adherent cells . 

By means of the cloning method of the present 
invention, isolation and molecular cloning of- genes 

0 5 encoding such cell surface antigens as the following have 
been accomplished: the CDla, CDlb, CDlc, CD2, CD6, CD7 , 
CD13, CD14, CD16, CD19, CD20, CD22, CD26, CD27, CD28, 
CD31, CDw32a / CDw32b, CD33, CD34, CD36, CD37, CD38, CD39, 
CD4 0, CD43, I CAM, LFA-3 , FcRIa, FcRIb, TLiSa, and Lev-8 

10 antigens. The nucleotide sequences of genes cloned by 

the method of the present invention have been determined 
and the amino acid sequences of the encoded proteins have 
been identified. A cloned gene, such as that encoding 
CDla, CDlb, CDlc, CD2 , CD 6 , CD7 , CD13 , CD14 , CD16 , CD19, 

15 CD20, CD22, CD26, CD27 , CD28, CD31, CDw32a, CDw32b, CD33, 
CD34, CD3 6, CD37, CD38, CD39, CD40, CD4 3, I CAM, LFA-3, 
FcRIa, FcRIb , TLiSa, and Lev-8, is also the subject of 
the present invention. 

Once the gene encoding an antigen has been cloned 

2 0 according to the method of the present invention, that 

gene can be expressed in a prokaryotic or a eukaryotic 
host cell to produce the ^encoded protein or portion 
thereof in substantially pure form such as it does not 
exist in nature. Another aspect of the present invention 
25 relates to substantially pure cell surface antigens, 
particularly: CDla, CDlb, CDlc, CD2 , CD6 , CD7 , CD13 , 
CD14, CD16, CD19, CD20, CD22, CD26, CD27, CD28, CD31, 
CDw32a, CDw32b, CD33, CD34, CD36, CD37, CD38, CD39, CD40, 
CD4 3, I CAM, LFA-3, FcRIa, FcRIb, TLiSa, and Lev-8 

3 0 antigens and their functional analogues and equivalents. 

The primary amino acid sequences of the CDla, CDlb, CD2 , 
CD7, CD14, CD16, CD19 , CD20, CD22 , CD28, CDw32a, CDw32b, 
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CD33, CD34, CD40, 1CAM, LFA-3 , FcRIa, FcRIb and TLiSa 
antigens have been determined. The invention thus also 
relates to the amino acid sequences of those antigens and 
their functional equivalents and to the nucleotide 

05 sequences encoding those antigens • 

This invention also relates to high efficiency cDNA 
expression vectors which allow the generation of very- 
large mammalian expression libraries and yield large 
amounts of protein in mammalian host cells, resulting in 

10 efficient selection. In a particular embodiment of this 
invention, a cDNA expression vector comprises a sup- 
pressor tRNA gene; an SV40 origin; a synthetic transcrip- 
tion unit, comprising a chimeric promoter composed of 
human cytomegalovirus AD169 immediate early enhancer 

15 sequencer fused to the HIV LTR -60 to +80 sequences, 

inserted between the suppressor tRNA gene and the SV4 0 
origin; a polylinker comprising two Bst XI sites separated 
by a replaceable DNA sequence and flanked by Xba l sites; 
and an SV40 small t antigen splice and early region 

20 polyadenylation signals o 

A further aspect of the present invention comprises 
a synthetic transcription unit for use in a cDNA expres- 
sion vector, comprising a chimeric promoter composed of 
human cytomegalovirus AD169 immediate early enhancer 

25 sequences fused to HIV LTR -60 to +80 sequences. The 

small size and particular arrangement of the sequences of 
the cDNA expression vector of the present invention allow 
highly efficient replication in host mammalian tissue 
culture cells, such as COS cells. Moreover, this vector 

3 0 employs a polylinker containing two inverted Bst XI sites 
separated by a short replaceable DNA segment , which 
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allows the use of very efficient oligonucleotide-based 
cDNA insertion strategy . 

In another aspect , the present invention comprises a 
vector comprising two identical BstXI sites in inverted 
0 5 orientation each with respect to the other, which Bst XI 
sites are separated by a short replaceable DNA fragment. 
Another aspect of the invention is a polylinker as 
described above . 

A further aspect of the invention relates to an 
10 oligonucleotide-based cDNA insertion method, comprising 
ligating synthetic DNA oligonucleotides to the cDNA 
segment desired to be inserted into a vector, the syn- 
thetic DNA oligonucleotides giving the same terminal 
sequences as those of the short replaceable DNA fragment 
15 of the polylinker of the invention, and inserting the 

resulting cDNA segment plus synthetic DNA oligonucleotide 
terminal sequences into the polylinker of the vector, 
from which the short replaceable DNA fragment previously 
has been removed „ 
2 0 In preparing cDNA libraries according to the present 

invention, it has been discovered that many tumors are 
heavily infiltrated by macrophages and lymphocytes, and 
thus may be employed as a source of macrophage or lympho- 
cyte transcripts to good effect, instead of tumor cell 
2 5 lines commonly used. In another aspect, then, the 

present invention relates to the use of tumor cells, 
particularly human tumor cells, to prepare cDNA libraries 
for use according to the methods of the present inven- 
tion, 

30 Another advantage of the powerful selection system 

of the present invention is that directional insertion of 
the cDNA is not necessary. The method of the present 
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invention results in library construction efficiencies 
which are on a par with those described for phage vectors 
such as A gt 10 and A gtll, with the additional advantage 
that clones generated according to the methods of the 
present invention are easier to manipulate. 

The immunoselection technique of the present inven- 
tion allows efficient use of antibodies, which may be 
monoclonal or polyclonal, in relatively small absolute 
amounts. The method of the present invention also is 
10 quite rapid. Generally, three or fewer cycles of immuno- 
selection and rescue are required to isolate a target 
cDNA clone. Thus, the method of the present invention 
also results in the efficient use of labor and materials 
when cloning genes encoding cell surface antigens. As 
described above, this method has been employed to 
successfully clone genes encoding cell surface antigens 
associated with mammalian T lymphocytes (e.g. antigens 
CDla, CDlb, CDlc, CD2, CD6, CD7 , CD13 , CD14, CD16, CD19 
CD20, CD22, CD26, CD27, CD28, CD31, CDw32a, CDw32b, CD33, 
CD34, CD36, CD37, CD38, CD39, CD40, CD43, ICAM, LFA-3 , 
FcRIa, FcRIb, TLiSa, and Lev-8) . 

The purified genes and proteins of the present 
invention are useful for immunodiagnostic and immuno- 
therapeutic applications, including the diagnosis and 
treatment of immune-mediated infections, diseases, and 
disorders in animals, including humans. They can also be 
used to identify, isolate and purify other antibodies and 
antxgens. Such diagnostic and therapeutic uses comprise 
yet another aspect of the present invention. Moreover 
the substantially pure proteins of the present invention 
may be prepared as medicaments or pharmaceutical 
compositions for therapeutic administration. The present 
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invention further relates to such medicaments and 
compositions. 

Brief Description of the Drawings 

Figure 1. Nucleotide sequence of expression vector piH3 
05 Nucleotides 1-58 9 are derived from pMBl origin 

(pBR322 ori) ; nucleotides 590-597 are derived from the 
SacII linker (ACCGCGT) ; nucleotides 598-799 are derived 
from the synthetic tyrosine suppressor tRNA gene (supF 
gene) ; nucleotides 800-947 are derived from a remnant of 
10 the ASV LTR fragment (PvuII to MIul) ; nucleotides 

948-1500 are derived from the human cytomegalovirus AD169 
enhancer; nucleotides 1501-1650 are derived from HIV TATA 
and tat-responsive elements; nucleotides 1651-1716 are 
derived from the piLNXAN polylinker ( Hin dlll to Xba ) ; 
15 nucleotides 1717-2569 are derived from pSV to splice and 
poly-Addition signals; nucleotides 2570-2917 are derived 
from the SV4 0 origin of replication ( pvu ll to ( Hin dlll) ; 
and nucleotides 2918-2922 are derived from piVX, remnant 
of ri site from polylinker. 

20 Figure 2. Nucleotide seguence of the CD2cDNA insert 
Nucleotide numbering is given in parentheses at 
right, amino acid numbering, center and left. Locations 
of the potential sites for addition of asparagine-linked 
carbohydrate (CHO) are shown, as well as the predicted 

25 transmembrane (TM) sequence. The amino acid sequence is 
numbered from the projected cleavage site of the 
secretory signal sequence. Proline residues are 
underscored with asterisks. The presumed polyadenylation 
signal for the 1 . 65kb . transcript , and a possible 
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polyadenylation signal for the 1.3kb transcript, are 
shown boxed. 

Figure 3 . Restriction map of the CDM8 expression vector 
The CDM8 vector includes a deleted version of a 
05 mutant polyoma virus early region selected for high 

efficiency expression in both murine and monkey cells. 

Substantially all of the human immunodeficiency promoter 

region has been replaced with the cognate sequences of 

the human cytomegalovirus immediate early promoter, and 
10 by inclusion of a bacteriophage T7 promoter between the 

eukaryotic promoter and the site of cDNA insertion. 

Arrows indicate the direction of transcription. 

Figure 4 . Nucleotide sequence and corresponding amino 
acid sequence of the LFA-3 antigen 

15 WOP cells transfected with a clone encoding the 

LFA-3 antigen were detected by indirect immuno- 
fluorescence , amplified and sequenced. (A) shows the 874 
base pair insert containing- an open reading frame of 237 
residues originating at a methionine codon, and 

20 terminating in a series of hydrophobic residues. 

Hydrophobic and hydrophilic regions within this open 
reading frame are shown in (B) . 

Figure 5. Restriction Map of the piH3M vector 

The direction of transcription is indicated by an 
25 arrow. Restriction endonuclease sites flanking the BstXI 
cloning sites are shown. 
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Figure 6, Nucleotide sequence of the piH3M vector 

There are 7 segments- Residues 1-587 are from the 
pBR322 origin of replication, 588-1182 from the M13 
origin, 1183-1384 from the supF gene, 1385-2238 are from 
0 5 the chimeric cytomegalovirus/human immunodeficiency virus 
promoter, 2239-2 647 are from the replaceable fragment, 
2 648-3547 from plasmid pSV2 (splice and polyadenylation 
signals) , and 3548-3900 from the SV40 virus origin. 

Figure 7. Nucleotide sequence of the CD28 cDNA 

Nucleotide numbering is given in parentheses at 
right, amino acid numbering, center and left. Location 
of the potential sites for addition of asparagine-linked 
carbohydrate (CHO) are shown, as well as the predicted 
transmembrane (TM) sequence. The amino acid sequence is 
numbered from the projected cleavage site of the 
secretory signal sequence. The presumed polyadenylation 
signal is shown boxed. 

Figure 8, Nucleotide sequence of the .CD7 cDNA insert 

Nucleotide numbering is given in parentheses at 
right. The amino acid sequence is numbered from the 
projected cleavage site of the secretory signal sequence. 
Splice donor and acceptor sites indicated by (/) . The 
location of the potential sites for addition of 
asparagine-linked carbohydrate (CHO) are shown, the 
25 potential fatty acid esterif ication site is denoted (*) , 
and the predicted transmembrane domain (TM) is 
underlined. Nucleotide sequences potentially involved in 
hairpin formation are denoted by ( . ) . The presumed 
polyadenylation signal is underlined. 
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Figure 9. Nucleotide sequence of the CDw3 2 cDNA 

Nucleotide number is given in the parenthesis at 
right, amino acid numbering, center and left. Locations 
of the potential sites for addition of asparagine-1 inked 
05 carbohydrate (CHO) are shown, as well as the predicted 

transmembrane (TM) sequence. The amino acid sequence is 
numbered from the projected cleavage site of the 
secretory signal sequence- Cysteine residues are 
underscored with asterisks. 

10 Figure 10 0 Secruence of the CD20.4 cDNA 

A. The sites of potential N-linked glycosylation 

are denoted by the symbol -CHO-: the hydrophobic ■ regions 

are underscored. The site of the poly (A) ~*~ tail in clone 

CD20„6 is denoted by an asterisk* 
15 B« Hydrophobicity profile of the amino acid 

sequence in A. 

Figure 11- Secruence of JCAM-1 

Complete nucleotide sequence of ICAM-1 cDNA insert 
and predicted protein sequence. Nucleotide numbering is 

20 at right, amino acid numbering in parentheses at left. 
The RGE motif at position 152 is underlined, the 
potential N-linked glycosylation sites are indicated by 
-CHO- and the transmembrane domain by — TM— . The amino 
acid sequence is numbered from the projected cleavage 

25 site of the signal peptide. Sequencing was by 

dideoxy-chain termination (Sanger, F. , et al. , Proc. 
Natl. Acad, Sci. USA , 74:5463-5467 (1977)), using a 
combination of subclones, and specific oligonucleotides. 



Figure 12. Nucleotide sequence of CD19 
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Figure 13. Nucleotide sequence of CD2 0 

Figure 14. Nucleotide sequence of CD22 

Figure 15. Nucleotide sequence of CDw32a 

Figure 16. Nucleotide sequence of CDw32b 

05 Figure 17. Nucleotide sequence of CD4 0 

Detailed Description of the Invention 

This invention relates to a novel method for cloning 
cDNA encoding a cell surface antigen and to a method of 
constructing cDNA libraries. It also relates to 
10 particular cDNA expression vectors and components 

thereof, nucleotide sequences or genes isolated by the 
method, substantially pure cell surface antigens encoded 
by the cDNA segments, and methods of using the isolated 
nucleotide sequences and encoded products. 
15 In the following description, reference will be made 

to various methodologies known to those of skill in the 
art of recombinant genetics . Publications and other 
materials setting forth such known methodologies to which 
reference is made are incorporated herein by reference in 
20 their entireties. 

Standard reference works setting forth the general 
principles of recombinant DNA technology include Darnell, 
J.E. et al . , Molecular Cell Biology , Scientific American 
Books, Inc., publisher, New York, N.Y. (1986); Lewin, 
- 5 B.M., Genes II , John Wiley & Sons, publisher, New York, 
N.Y. (1985); Old, R.W. et al. , Principles of Gene 
Manipulation: An Introduction to Genetic Engineering , 2d 
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edition, University of California Press , Berkeley, CA 
(1981) ; and Maniatis, T. et al. # Molecular Cloning-: A 
Laboratory Manual , Cold Spring Harbor, NY (1982) • 

By "cloning" is meant the use of in vitro recom- 

05 bination techniques to insert a particular gene or other 
DNA sequence into a vector molecule. In order to suc- 
cessfully clone a desired gene, it is necessary to employ 
methods for generating DNA fragments, for joining the 
fragments to vector molecules, for introducing the 

10 composite DNA molecule into a host cell in which it can 
replicate, and for selecting the clone having the target 
gene from amongst the recipient host cells. 

By "cDNA" is meant complementary or copy DNA 
produced from an RNA template by the action of 

15 RNA-dependent DNA polymerase (reverse transcriptase) * 

Thus a "cDNA clone" means a duplex DNA sequence comple- 
mentary to an RNA molecule of interest, carried in a 
cloning vector- 

By "cDNA library" is meant a collection of 

20 recombinant DNA molecules containing cDNA inserts which 

together comprise the entire genome of an organism- Such 
a cDNA library may be prepared by art-recognized methods 
described, for example, in Maniatis et al . , Molecular 
Clonings A Laboratory Manual , supra - Generally, RNA is 

25 first isolated from the cells of an organism from whose 
genome it is desired to clone a particular gene. 
Preferred for the purposes of the present invention are 
mammalian, and particularly human, cell lines. More 
preferred are the human tumor cell line HP B- ALL and the 

3 0 human lymphoblastoid cell line JY. Alternatively, RNA 

can be isolated from a tumor cell, derived from an animal 
tumor, and preferably from a human tumor. Thus, a 
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library may be prepared from, for example, a human 
adrenal tumor, but any tumor may be used. 

The immunoselection cloning method of the present 
invention comprises the preparation of a cDNA library by 
05 extracting total RNA including a particular gene from a 
cell, synthesizing a series of complementary 
double-stranded cDNA fragments from the RNA and 
introducing these cDNA fragments into mammalian cells in 
tissue culture. The mammalian cells are maintained under 
10 conditions which allow them to express the protein (i.e. 
the cell surface antigen) . The resulting cells are 
exposed to a first antibody or pool (group) of antibodies 
directed against the cell surface antigen. This results 
in formation of a cell surface antigen-first antibody 
15 complex. The complexes are exposed to a substrate to 
which is coated or bound a second antibody directed 
against the first antibody. Cells expressing the cell 
surface antigen adhere to the substrate (because of 
formation of a cell surface antigen-first antibody-second 
2 0 antibody complex) « Adherent cells are separated from 
non-adherent cells * 

Isolation of total RNA 

The guanidium thiocyanate/CsCl method of isolating 
total RNA is preferred. More preferred is a guanidium 

2 5 thiocyanate/LiCl variant of the GuSCN/CsCl method, which 
has added capacity and speed. Briefly, for each ml of 
mix desired, 0.5g GuSCN are dissolved in 0.58ml of 25% 
LiCl (stock filtered through 0.45 micron filter) and 20ul 
of mercaptoethanol is added. Cells are spun out and the 

30 pellet is dispersed on walls by flicking, add 1ml of 
solution to up to 5 x 10 cells. The resulting 
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combination is sheared by polytron until nonviscous. For 
small scale preps (less than 10 8 cells) layer 2ml of 
sheared mix on 1.5ml of 5.7M CsCl (RNase free; 1.26g CsCl 
added to every ml lOmM EDTA pH 8), overlay with 
RNase-free water and spin SW55 50krpm 2h. For large 
scale preps, layer 25ml on 12ml CsCl in a SW28 tube, 
overlay, and spin 24k rpm 8h. Aspirate contents 
carefully with a sterile pasteur pipet connected to a 
vacuum flask. once past the CsCl interface, scratch a 
10 band around the tube with the pipet tip to prevent the 
layer on the wall of the tube from creeping down. The 
remaining CsCl solution is aspirated. The pellets are 
taken up in water (do not try to redissolve) . l/io vol. 
NaOAc and 3 vol. EtOH are added and the resulting 
15 combination is spun. If necessary, the pellet is 
resuspended in water (e.g., at 70°). Adjust 
concentration to lmg/ml and freeze. Small RNA (e.g. 5S) 
does not come down. For small amounts of cells, scale 
down volumes and overlay GuSCN with RNase-free water on 
gradient (precipitation is inefficient when RNA is 
dilute) o 

Preparation of poly A* RNA 

Next, polyA + RNA may be prepared, preferably by the 
oligo dT selection method. Briefly, a disposable 
polypropylene column is prepared by washing with 5M NaOH 
and then rinsing with RNase-free water. For each milli- 
gram total RNA about' 0.3 ml (final packed bed) oligo dT 
cellulose is used. Oligo dT cellulose is prepared by 
resuspending about 0.5 ml of dry powder in l ml of o.lM 
NaOH and transferring it into the column, or by 
percolating 0.1 NaOH through a previously used column 
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(columns can be reused many times) . This is washed with 
several column volumes of RNase-free water, until pH is 
neutral, and rinsed with 2-3 ml of loading buffer. The 
column bed is then removed into a sterile 15ml tube using 

05 4-6 ml of loading buffer. The total RNA to 70 "C for 2-3 
min. , LiCl from RNase-free stock is added (to 0.5M), and 
combined with oligo dT cellulose in a 15 ml tube. This 
is followed by vortexing or agitation for 10 min. The 
result is poured into a column and washed with 3 ml 

10 loading buffer and then 3 ml of middle wash buffer. mRNA 
is eluted directly into an SW5 5 tube with 1.5 ml of 2mM 
EDTA, 0.1% SDS; the first two or three drops are 
discarded. 

Eluted mRNA is precipitated by adding 1/10 vol. 3M 
15 NaOAc and filling the tube with EtOH. This is then 

mixed, chilled for 30 minutes at -20 °C, and spun at 50k 
rpm at 5°C for 3 0 min. The EtOH is poured off and the 
tube is air dried. The mRNA pellet is resuspended in 
50-lOQul of RNase-free water. Approximately 5 ul is 
20 melted at 70° in MOPS/EDTA/f ormaldehyde and run on an 
RNase-free 1% agarose gel to check quality . 

cDNA Synthesis 

From this, cDNA is synthesized. A preferred method 

of cDNA synthesis is a variant of that described by 
25 Gubler and Hoffman ( Gene 2^:263-269 (1982)). This is 

carried out as follows: 

a. First Strand. 4 ug of mRNA and heated to 

about 100 °C in a microfuge tube for 30 seconds and 

quenched on ice. The volume is adjusted to 70 ul with 
30 RNase-free water. The following are added: 20 ul of RT1 

buffer, 2 ul of RNAse inhibitor (Boehringer 3 6 u/ul) , 1 
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ul of 5 ug/ul of oligo dT (Collaborative Research), 2.5 
ul of 20 mM dXTP's (ultrapure) , 1 ul of 1 M DTI and 4 ul 
of RT-LX (Life Science, 24 u/ul) - The resulting 
combination is incubated at 42 'C for 40 min. It is 
05 heated to inactivate (70 °C 10 min) . 

b. Second Strand. 320 ul of RNAse free 
water, 80 ul of RT2 buffer, 5 ul of DNA Polymerase I 
(Boehringer, 5 U/ul) , 2 ul RNAse H (BRL 2 u/ul) . Incu- 
bate at 15 *C for 1 hr and 22 °C for 1 hr. Add 20 ul of 
10 0.5M EDTA pH 8.0, phenol extract and EtOH precipitate by 
adding NaCl to 0.5M, linear polyacryl amide (carrier) to 
20ug/ml, and filling tube with EtOH. Spin 2-3 minutes in 
microfuge, remove, vortex to dislodge precipitate high up 
on wall of tube, and respin 1 minute. 
15 c. Adaptors. Resuspend precipitated cDNA in 

240 ul of TE (10/1) . Add 30ul of lOx low salt buffer, 
30ul of 10X low salt buffer, 30ul of 10X ligation 
additions, 3ul (2.4ug) of kinased 12-mer adaptor, 2ul 
(1.6ug) of kinased 8-mer adaptor, and 1 ul of T4 DNA 
20 ligase (BioLabs, 400 u/ul, or Boehringer, 1 Weiss 

unit/ml) . Incubate at 15 °C overnight. Phenol extract 
and EtOH precipitate as above (no extra carrier now 
needed) , and resuspend in 100 ul of TE. 

Use of cDNA fragments in expression vectors 

For use with the BstX I-based cDNA expression vectors 
of the invention, (see infra ) oligonucleotide segments 
containing terminal sequences corresponding to BstX I 
sites on the vectors are ligated to the cDNA fragment 
desired to be inserted. The resulting fragments are 
pooled by fractionation. A preferred method is as 
follows: 
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Prepare a 20% KOA, 2mM EDTA, 1 ug/ml EthBr solution 
and a 5% KOAc, 2mM EDTA, 1 ug/iul EthBr solution- Add 2-6 
ml of 2 0% KOAc solution to back chamber of a small 
gradient maker. Remove air bubble from tube connecting 

0 5 the two chambers by allowing solution to flow into the 

front chamber and then tilt back. Close passage between 
chambers, and add 2.5ml. of the 5% solution to the front 
chamber. If there is liquid in the tubing from a 
previous run, allow the 5% solution to run just to the 

10 end of the tubing, and then return to chamber. Place the 
apparatus on a stirplate, set the stir bar moving as fast 
as possible, open the stopcock connecting the two cham- 
bers and then open the front stopcock. Fill a 
polyallomer SW55 tube from the bottom with the KOAc 

15 solution. Overlay the gradient with 100 ul of cDNA 

solution. Prepare a balance tube and spin the gradient 
for 3 hrs at 50k rpm at 22 *C. To collect fractions from 
the gradient, pierce the SW55 tube with a butterfly 
infusion set (with the luer hub clipped off) close to the 

2 0 bottom of .the tube and collect three 0.5ml fractions and 

then 6 0.2 5ml fractions into microfuge tubes (about 2 2 
and 11 drops respectively) . EtOH precipitate the 
fractions by adding linear polyacrylamide to 2 0 ug/ml and 
filling the tube to the top with EtOH. After cooling 
25 tubes, spin them in a microfuge for 3 min. Vortex and 

respin 1 min. Rinse pellets with 70% EtOH (respin) . Do 
not dry to completion. Resuspend each 0.25ml fraction in 
10 ul of TE. Run 1 ul on a 1% agarose minigel . Pool the 
first three fractions, and those of the last six which 

3 0 contain no material smaller than lkb. 

Suppressor tRNA plasmids may be propagated by known 
methods. In a preferred method according to the present 
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invention, supF plasmids can be selected in nonsuppress- 
ing hosts containing a second plasmid, p3 , which contains 
amber mutated ampicillin and tetracycline drug resistance 
elements (Seed, 1983) . The p3 plasmid is derived from 

05 PR1, is 57kb in length, and is a stably maintained, 

single copy episome. The ampicillin resistance of this 
plasmid reverts at a high rate, so that amp plasmids 
usually cannot be used in p3-containing strains. Se- 
lection for tet resistance alone is almost as good as 

10 selection for ammp+tet resistance. However, spontaneous 
appearance of chromosomal suppressor tRNA mutations 
presents an unavoidable background (frequency about 10 ) 
in this system. Colonies arising from spontaneous 
suppressor mutations are usually bigger than colonies 

15 arising from plasmid transformation. Suppressor plasmids 
typically are selected for in LB medium containing amp at 
12-5 ug/ml and tet at 7-5 ug/ml- For large plasmid 
preps, M9 casamino acids medium containing glycerol 
(0.8%) may be used as a carbon source, and the bacteria 

20 grown to saturation e 

Vector DNA may be isolated by known methods. The 
following method is preferred for plasmid from 1 liter of 
saturated cells: 

Spin down cells in 1 liter J6 bottles, 4.2k rpm, 25 

25 minutes. Resuspend in 40 ml lOmM EDTA pH 8 (Thump on 

soft surface). Add 80 ml 0.2M NaOH, 1% SDS, swirl until 
clearish, viscous. Add 40 ml 5M KOAc, pH4 . 7 (2.5M KOAc, 
2.5M HOAc) shake semi -vigorously (until lumps are 2-3 mm 
in size) „ Spin (same bottle) 4.2 rpm, 5 min. Pour 

3 0 supernatant through cheesecloth into 250 ml bottle- Fill 
bottle with isopropyl alcohol. Spin J6, 4.2k rpm, 5 min. 
Drain bottle, rinse gently with 7 0% EtOH (avoid 
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fragmenting the pellet). Invert bottle , and remove 
traces of EtOH with Kimwipe. Resuspend in 3.5 ml Tris 
base/EDTA 2 0mM/10mM. Add 3.75 ml of resuspended pellet 
to 4.5g CsCl. Add 0,75 ml 10/mg/ml ethidium "bromide, 
05 mix. Fill VTi80 tubes with solution. Run at a speed of 
80 rpm for 2.5 hours or longer. Extract bands by visible 
light with 1 ml syringe and 2 0 gauge or lower needle. 
Cut top off tube, insert the needle upwards into the tube 
at an angle of about 30° with respect to the tube, (i.e., 
10 as shallowly possible) at a position about 3mm beneath 
the band, with the bevel of the needle up. After the 
band is removed, pour tube contents into bleach. Deposit 
extracted bands in 13 ml Sarstedt tube. Fill tube to top 
with n-butanol saturated with 1M NaCl, extract. If a 
15 very large quantity of DNA is obtained, reextract. 

Aspirate butanol into trap containing 5M NaOH (to destroy 
ethidium) . Add about equal volume 1M ammonium acetate to 
DNA (squirt bottle) • Add about 2 volumes 95% ethanol 
(squirt bottle) . Spin 10K rpm, 5 min. J2-21. Rinse 
20 pellet carefully with 70% ethanol. Dry with swab, or 
lyophilizer . 

The vector may be prepared for cloning by known 
methods. A preferred method begins with cutting 20 ug of 
vector in a 200 ul reaction with 100 units of BstXI (New 

25 York Biolabs) , cutting at 50 °C overnight in a 

well-thermostatted water bath (i.e., circulating water 
bath) . Prepare 2 KOAc 5-20% gradients in SW55 tubes as 
described above. Add 100 ul of the digested vector to 
each tube and run for 3 hrs, 5 OK rpm at 22 °C. Examine 

30 the tube under 300nm UV light. The desired band will 
have migrated 2/3 of the length of the tube. Forward 
trailing of the band means the gradient is overloaded. 
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Reiaove the band with a 1 ml syringe and 20 gauge needle. 
Add linear* polyacrylamide and precipitate the plasmid by 
adding 3 volumes of EtOH « Resuspend in 50 ul of TE. Set 
up ligations using a constant amount of vector and 

05 increasing amounts of cDNAs. On the basis of these trial 
ligations r set up large scale ligation, which can be 
accomplished by known methods « Usually the entire cDNA 
prep requires 1-2 ug of cut vector. 

Adaptors may be prepared by known methods , but it is 

10 preferred to resuspend crude adaptors at a concentration 
of 1 ug/ul, add MgS0 4 to 10 mM, and precipitate by adding 
5 volumes of EtOH* Rinse with 70% EtOH and resuspend in 
TE at a concentration of 1 ug/ul. To kinase take 25ul of 
resuspended adaptors, add 3ul of 10X kinasing buffer and 

15 20 units of kinase; incubate 37 °C overnight. 

Preparation of buffers mentioned in the above 
description of preferred methods according to the present 
invention will be evident to those of skill. For conve- 
nience, preferred buffer compositions are as follows: 

20 Loading Buffers 0.5 M LiCl, lOmM Tris pH 7.5, ImM 

EDTA 0.1% SDS o 

Middle Wash Buffer 0.15 M LiCl, lOmM Tris pH 7.5, ImM 

EDTA 0-1% SDS . 

Rtl Buffers 0.25 M Tris pH 8.8 (8,2 at 42°), 

25 0.25 M KC1, 30 mM MgCl 2 . 

RT2 Buffer: 0.1 M Tris pH 7.5, 25 mM MgCl , 0.5 

M KC1, 0.25 mg/ml BSA, 50 mM DTT. 

10X Low Salt 60 mM Tris pH 7.5, 60 mM MgCl , 50 

mM NaCl, 2.5 mg/ml BSA 70 mM Me. 

30 10X Ligation ImM ATP, 20 mM DTT, 1 mg/ml BSA 10 

Additions: mM spermidine. 
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10X Kinasing 0,5 M Tris pH 7.5, lOmH ATP, 2 0mM 

Buffer: DTT, 10 mM spermidine, 1 mg/ml BSA 

100 mM MgCl 2 . 

By "vector" is meant a DNA molecule, derived from a 
05 plasmid or bacteriophage, into which fragments of DNA may 
be inserted or cloned. A vector will contain one or more 
unique restriction sites, and may be capable of autono- 
mous replication in a defined host or vehicle organism 
such that the cloned sequence is reproducible- Thus, by 
10 "DNA expression vector" is meant any autonomous element 
capable of replicating in a host independently of the 
host's chromosome, after additional sequences of DNA have 
been incorporated into the autonomous element's genome - 
Such DNA expression vectors include bacterial plasmids 
15 and phages. 

Preferred for the purposes of the present invention, 
however, are viral vectors, such as those derived from 
simian virus strain 4 0 (SV4 0) - SV4 0 is a papovavirus 
having a molecular weight of 28 Mdal, and containing a 

2 0 circular double-stranded DNA molecule having a molecular 

weight of 3 Mdal, which comprises the entire genome of 
the virus- The entire nucleotide sequence of this 
single, small, covalently closed circular DNA molecule 
has been determined. Fiers et al. , Nature 273 : 113-120 
25 (1978); Reddy et al. f Science 200 : 494-502 (197S). The 
viral DNA of SV4 0 may be obtained in large quantities, 
and the genomic regions responsible for various viral 
functions have been accurately located with respect to a 
detailed physical map of the DNA, Fiers et al ■ , supra ; 

3 0 Reddy et al . , supra . The viral genome of SV4 0 can 

multiply vegetatively or as an integral part of cellular 
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chromosomes, and. a wealth of information exists on the 
replication and expression of this genome. 

Also preferred for the purposes of the present 
invention is a single- stranded bacteriophage 'cloning 
vehicle, designated M13, having a closed circular DNA 
genome of approximately 6.5 kb. An advantage of utiliz- 
ing M13- as a cloning vehicle is that the phage particles 
released from infected, cells contain single-stranded DNA 
homologous to only one of the two complementary strands 
of the cloned DNA, which therefore can be used as a 
template for DNA sequencing analysis. 

Even more preferred for the purposes of the present 
invention are the expression vectors designated piH3 , 
piH3M, and CDM8 , deposited at the American Type Culture 
15 Collection (ATCC) , 12301 Parklawn Drive, Rockville, MD 

20852. piH3 was deposited at the ATCC on , 

and has accession number ATCC . piH3M was 

deposited at the ATCC on , and has 

accession number ATCC_ . C DM8 was deposited 

20 at the ATCC on , and has accession number 

ATCC . 

By "tissue culture" is meant the maintenance or 
growth of animal tissue cells in vitro so as to allow 
further differentiation and preservation of cell archi- 

25 tecture or function or both. "Primary tissue cells" are 
those taken directly from a population consisting of 
cells of the same kind performing the same function in an 
organism. Treating such tissue cells with the 
proteolytic enzyme trypsin, for example, dissociates them 

30 into individual primary tissue cells that grow well when 
seeded onto culture plates at high densities. Cell 
cultures arising from multiplication of primary cells in 
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tissue culture are called "secondary cell cultures." 
Most secondary cells divide a finite number of times and 
then die. A few secondary cells, however, may pass 
through this "crisis period", after which they are able 
0 5 to multiply indefinitely to form a continuous "cell 

line." Cell lines often will contain extra chromosomes, 
and usually are abnormal in other respects as well. The 
immortality of these cells is a feature shared in common 
with cancer cells . 
10 Preferred cell lines for use as tissue culture cells 

according to the present invention include the monkey 
kidney cell line, designated "COS." COS cells are those 
that have been transformed by SV4 0 DNA containing a 
functional early gene region but a defective origin of 
15 viral DNA replication* COS cell clone M6 is particularly 
preferred for use according to the method of the in- 
vention. Also preferred for the purposes of the present 
invention are murine "WOP" cells, which are NIH 3T3 cells 
transfected with polyoma origin deletion DNA. cDNA may 
2 0 be introduced into the host tissue culture cells of the 

present invention by any methods known to those of skill. 
Transfection may be accomplished by, for example, 
protoplast fusion by spheroplast fusion, or by the DEAE 
dextran method (Sussman et al . , Cell. Biol. _4: 1641-1643 
25 (1984)). 

If spheroplast fusion is employed, a preferred 
method is the following variant based on Sandri-Goldrin 

^1 . , Mol. Cell Bio. 1:743-752 (1981). Briefly, for 
example, a set of six fusions requires 100 ml of cells in 
30 broth. Grow cells containing amplifiable plasmid to 
OD600=0.5 in LB. Add spectinomycin to 100 ug/ml (or 
chloramphenicol to 150 ug/ml) . Continue incubation at 
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37 °c with shaking for 10-16 hours, (Cells begin to lyse 
with prolonged incubation in spectinomycin or 
chloramphenicol medium) „ Spin down 100 ml of culture 
(JA14/GSA rotor, 250ml bottle) 5 min- at 10,000 rpm. 

05 Drain well, resuspend pellet in bottle with 5ml cold 20% 
sucrose, 50mM Tris-HCL pH 8„0o Incubate on ice 5 min- 
Add 2 ml cold 0o25M EDTA pH 8o0, incubate 5 min- at 37° C 
(waterbath) o Place on ice, check percent conversion to 
spheroplasts by microscopy- In flow hood, slowly add 

10 20ml of cold DME/10% sucrose/lOmM MgCl 2 (dropwise, ca» 2 
drops per second) . Remove media from cells plated the 
day before in 6cm dishes (50% confluent) . Add 5ml of 
spheroplast suspension to each dish. Place dishes on top 
of tube carriers in swinging bucket centrifuge. Up to 6 

15 dishes can be comfortably prepared at once. Dishes can 
be stacked on top of each other, but 3 in a stack is not 
advisable as the spheroplast layer on the top dish is 
often torn or detached after centrifugation. Spin at 
lOOOxg 10 min 0 Force is calculated on the basis of the 

20 radius to the bottom plate. Aspirate fluid from dishes 
carefully. Pipet 1.5-2ml 50% (w/w) PEG1450 (or 
PEG1000)/50% DME (no serum) into the center of the dish. 
If necessary, sweep the pipet tip around to ensure that 
the PEG spreads evenly and radially across the whole 

25 dish., After PEG has been added to the last dish, prop 
all of the dishes up on their lids so that the PEG 
solution collects at the bottom- Aspirate the PEG. The 
thin layer of PEG that remains an the cells is sufficient 
to promote fusion; the layer remaining is easier to wash 

3 0 off, and better cell viability can be obtained, than if 
the bulk of the PEG is left behind. After 9 0 to 12 0 
seconds (PEG 1000) or 120 to 150 seconds (PEG 1450) of 
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contact with the PEG solution, pipet 1.5ml of DME (no 
serum) into the center of the dish. The PEG layer will 
be swept radially by the DME. Tilt the dishes and 
aspirate. Repeat the DME wash. Add 3ml of DME/ 10% serum 
05 containing 15 ug/ml gentamicin sulfate. Incubate 4-6 

hours in incubator. Remove media and remaining bacterial 
suspension, add more media and incubate 2-3 days. 
Extensive washing of the cell layer to remove PEG tends 
to remove many of the cells without any substantial 
10 benefit. If the cells are allowed to sit in the second 

DME wash for a few minutes, most of the spheroplast layer 
will come up spontaneously; however it is preferred to 
wash briefly and allow the layer to come off in the 
complete medium at 37 °C. 
15 The PEG solution can be conveniently prepared by 

melting a fresh bottle of PEG at 60°c and pouring approx- 
imate 5 0 ml aliquots by means of a 50 ml centrifuge tube 
into preweighed bottles. The aliquoted PEG is stored at 
5°C in the dark. To make up a fresh bottle, weigh the 
20 aliquot, remelt, and add an equal volume of DME (no 

serum). Adjust the pH with 7.5% Na Bicarbonate solution 
if necessary, and filter sterilize. The resulting PEG 
solution may be stored up to 3 months at room temperature 
without detectable adverse consequence. 
2 5 Transfected host cells will be cultured according to 

the invention in order to accomplish expression of the 
protein encoded by the cDNA clone, and to increase the 
absolute numbers of cells available for subsequent 
immunoselection. Those skilled in the art will know of 
30 appropriate methods and media for this purpose, taking 

into account the cell type and other variables routinely 
considered. COS cells, for example, may be cultured in 
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Dulbecco's modified Eagle's medium (DME) supplemented 
with 10% calf serum and gentamycin sulfate. Transient 
expression of trans fected cells normally can be expected 
between 48 and 72 hours pos ttrans feet ion <, However, this 

05 time period may vary depending upon the type or strain of 
host cell used and the cell culture conditions , as will 
be apparent to those of ordinary skill* 

Immunoprecipitation, blotting, and cDfcTA sequencing 
of genes cloned according to the methods of the present 

10 invention may be carried out by any convenient methods 
known to those of skill* For example, the 
immunoprecipitation protocol of Clark et al. , Leukocyte 
Typing II , Vol. II, pp, 155-167 (1986) , is preferred* 
Southern, Northern, or other blot analysis methods known 

15 to those of skill may be employed, using hybridization 
probes prepared by known methods, such as that of Hu et 
alo ( Gene 18:271-277 (1982) cDNA sequencing also may 
be accomplished by known methods, including the 
dideoxynucleotide method of Sanger et al „ , P.N..A.S. (USA) 

20 74:5463-5467 (1977)* 

The antibodies used according to the present in- 
vention may be polyclonal or monoclonal- These may be 
used singly, or in conjunction with other polyclonal or 
monoclonal antibodies to effect immunoselection of cells 

25 expressing the desired antigen or antigens by the methods 
of the present invention • Methods of preparing anti- 
bodies or fragments thereof for use according to the 
present invention are known to those of skill. 

Standard reference works setting forth general 

30 principles of immunology include Klein, J» , Immunology: 
The Science of Self-Nonself Discrimination , John Wiley & 
Sons, publisher, New York (1982); Kennett, R. , et al, 
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eds., Laboratory Techniques in Biochemistry and Molecular 
Biology , Vol. 13, Elsevere, publisher, Amsterdam (1984). 

The term "antibody" is meant to include the intact 
molecule as well as fragments thereof, such as, for 
05 example, Fab and F(ab) 1 fragments, which also are able 

to bind to antigen. Polyclonal antibody preparations may 
be derived directly from the blood of the desired animal 
species after immunization with the antigen of interest, 
or a fragment thereof, using any of the standard proto- 
10 cols known to those of ordinary skill. Similarly, 

monoclonal antibodies may be prepared using known methods 
(Kohler et al . , Eur. J. Immunol 6:292 (1976)). Use of 
monoclonal antibodies is preferred for the purposes of 
the present invention. 
15 For the purposes of immunoselection according to the 

present invention, the tissue culture host cells which 
have been exposed to antibodies directed against the 
target cell surf ace antigen are separated from host cells 
which do not express the target antigen by distributing 
2 0 the cells onto a substrate coated with antibody directed 
against the antibody for the antigen. This technique, 
termed "panning, 11 will be known to those of skill, and is 
described, for example, by Mage et al. ., J. Immunol. 
Meth. 15:47-56 (1977), and Wysocki and Sato, P.N.A.S. 
25 (USA) 75 :2844-2848 (1978). 

Panning according to the methods of the present 
invention may be carried out as follows: 

a. Antibody-coated dishes. Bacteriological 
60mm plates, Falcon 1007 or equivalent, or 10cm dishes 
30 such as Fisher 8-757-12 may be used. Sheep anti-mouse 
affinity purified antibody (from, for example, Cooper 
BioMedical (Cappell) ) is diluted to lOug/ml in 50mM Tris 
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HC1, pH 9.5. Add 3ml per 6cm dish, or 10ml per 10cm 
dish.. Let sit ca. 1.5 hrs., remove to next dish 1.5 
hrs., then to 3rd dish. Wash plates 3x with 0.15 NaCl (a 
wash bottle is convenient for this) , incubate with 3ml 

05 Img/ml BSA in PBS overnight, aspirate and freeze. 

bo Panning. Cells will be in 60mm dishes. 
Aspirate medium from dish, add 2ml PBS/0. 5mM EDTA/0. 02% 
azide and incubate dishes at 37°C for 30 min* to detach 
cells from dish. Triturate cells vigorously with short 

10 pasteur pipet, and collect cells from each dish in a 
centrifuge tube. Spin 4 min. setting 2.5 (200 x g) 
(takes 5 min). Resuspend cells in 0.5 —1.) ml 
PBS/EDTA/azide/5% FBS and add antibodies. Incubate at 
least 30 min. on ice. Add an equal volume of 

15 PBS/EDTA/azide, layer carefully on 3 ml PBS/EDTA/azide/2% 
Ficoll, and spin 4 min. at setting 2.5. Aspirate 
supernatant in one smooth movement. Take up cells in 
OoSml PBS/EDTA/azide and add aliquots to antibody-coated 
dishes containing 3ml PBS/EDTA/azida/5% FBS by pipetting 

20 through 100 micron Nylon mesh (TetJco) o Add cells from at 
most two 60mm dishes to one 60mm antibody-coated plate. 
Let sit at room temperature 1—3 hours. Remove excess 
cells not adhering to dish by gentle washing with PBS/5% 
serum or with medium. 2 or 3 washes of 3ml are usually 

25 sufficient. 

Co Hirt Supernatant. A preferred variant of 
the method of Hirt, J. Molec. Biol . 26^:365-369 (1967), is 
as follows; Add 0.4 ml 0.6% SDS, lOmM EDTA to panned 
plate. Let sit 20 minutes (can be as little as 1 min. if 

30 there are practically no cells on the plate) . Pipet 

viscous mixture into microfuge tube. Add 0.1ml 5M NaCl, 
mix, put on ice at least 5 hrs. Keeping the mixture as 
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cold as possible seems to improve the quality of the 
Hirt. Spin 4 min. , remove supernatant carefully, phenol 
extract (twice if the first interface is not clean), add 
lOug linear polyacrylamide (or other carrier; , fill tube 
05 to top with EtOH, precipitate, and resuspend in 0.1ml. 

Add 3 volumes EtOH/NaOAc, reprecipitate and resuspend in 
0.1 ml. Transform into MC1061/p3, preferably using the 
high efficiency protocol hereinafter described- If the 
DNA volume exceeds 2% of the competent cell aliquot, the 
10 transformation efficiency will suffer. 5% gives the same 
number of colonies as 2.5% (efficiency is halved). 

It is preferred for this aspect of the present 
invention to use "blockers" in the incubation medium. 
Blockers assure that non-specific proteins, proteases, or 
15 antibodies present do not cross-link with or destroy the 
antibodies present on the substrate or on the host cell 
surface, to yield false positive or false negative 
results. Selection of blockers can substantially improve 
the specificity of the immunoselection step of the 
20 present invention, A number of non-specific monoclonal 
antibodies, for example, of the same class or subclass 
(isotype) as those used in the immunoselection step 
(e.g., IgG x , IgG^A, IgGm, etc.) can be used as blockers. 
Blocker concentration (normally l-100ug/ul) is important 
25 to maintain the proper sensitivity yet inhibit unwanted 
interference. Those of skill also will recognize that 
the buffer system used for incubation may be selected to 
optimize blocking action and decrease non-specific 
binding. 

30 A population of cells to be panned for those ex- 

pressing the target cell surface antigen is first de- 
tached from its cell culture dish (harvested) without 
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trypsin. The cells then are exposed to a first antibody, 
which may be polyclonal or monoclonal, directed against 
the antigen of interest or against a family of related 
antigens. At this initial stage, a single antibody or a 

05 group of antibodies may be used, the choice depending 
upon the nature of the target antigen, its anticipated 
frequency, and other variables that will be apparent to 
those of skill. Target antigens expressed on the sur- 
faces of host cells will form an antigen-antibody com- 

10 plex. 

The cells subsequently are placed in close 
apposition to a substrate, such as a culture dish, filter 
disc, or the like, which previously has been coated with 
a second antibody or group of antibodies. This second 

15 antibody will be directed against the first antibody, and 
its choice will be a matter of ordinary skill dictated 
by, for example, the animal in which the first antibody 
was raised o For example, if the first antibody was 
raised in mice, the second antibody might be directed 

20 against mouse immunoglobulins, raised in goats or sheep. 
Cells expressing the target antigen will adhere to the 
substrate' via the complex formed between the antigen, the 
first antibody, and the second antibody. Adherent cells 
then may be separated from nonadherent cells by washing. 

25 DNA encoding the target antigen is prepared from adherent 
cells by known methods, such as that of Hirt, J. Molec. 
BiQl - 2^:365-369 (1967) • This DNA may be transformed 
into E. coli or other suitable host cells for further 
rounds of fusion and selection, to achieve the desired 

30 degree of enrichment. 

In the usual case, the initial rounds of 
iamunoselection will employ a panel of first antibodies 
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directed against an epitope or group of epitopes common 
to the family of antigens to which the target antigen 
belongs. This will be sufficient to narrow the number of 
clones for future rounds quite significantly.' Two such 
05 rounds usually will be found adequate, but the number of 
rounds may vary as mentioned above. Thereafter, a single 
round of selection may be performed employing a single 
first antibody or a group of first antibodies recognizing 
only the target antigen, 
10 By "substrate" is meant a solid surface to which 

antibodies may be bound for iitununoselection according to 
the present invention. Known suitable substrates include 
glass, polystyrene, polypropylene, dextran, nylon, and 
other materials. Tubes, beads, microtiter plates, 
15 bacteriological culture dishes, and the like formed from 
or coated with such materials may be used. Antibodies 
may be covalently or physically bound to the substrate by 
known techniques, such as covalent bonding via an amide 
or ester linkage, or by absorption. . Those skilled in the 
2 0 art will know many other suitable substrates and methods 
for immobilizing antibodies thereupon, or will be able to 
ascertain such substrates and methods using no more than 
routine experimentation . 

The choice of host tissue culture cells for use 
25 according to the present invention preferably should be 
such as to avoid the situation in which the antibodies 
used for panning recognize determinants on untransf ected 
cells. Thus, while COS cells are preferred for transient 
expression of certain surface antigens, more preferred 
30 are murine WOP cells. Of the latter, WOP 3 027 cells are 
even more preferred. WOP cells allow virtually all 
antibodies to be used, since cross-reactions between 
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murine antibodies and murine cell surface determinants 
are rare. 

The insert size of the recombinant DNA molecule 
should be chosen to maximize the likelihood of obtaining 
05 an entire coding sequence. Those of skill will know- 
various methods by which a preliminary determination of 
optimal insert size for a given gene may be determined. 

Vector construction and cDNA insertion 

Vectors suitable for expression of cDNA in mammalian 

10 tissue culture cells may be constructed by known methods. 
Preferred for the purposes of the present invention is an 
expression vector containing the SV40 origin. The vector 
may contain a naturally derived or synthetic tran- 
scription origin , and the SV40 early region promoter * 

15 Even more preferred is a chimeric promoter composed of 

human cytomegalovirus immediate early enhancer sequences. 
Various "enhancer sequences" also may be used with SV40 
vectors. These are described , for example, by Baner j i et 
al - / Cell 27:299-308 (1981); Levinson et al. . Nature 

20 295:568-572 (1982); and Conrad et al» . Hoi. Cell. Biol. 
2:949-965 (1982). 

Insertion of cDNA into the vectors of the present 
invention can occur, for example, by homopolymeric 
tailing with terminal transferase. However, 

25 homopolymeric tracts located 5 8 to cDNA inserts may 

inhibit in vitro and in vivo expression. Thus, preferred 
for purposes of the present invention is the use of 
inverted identical cleavage sites separated by a short 
replaceable DNA segment. Such inverted identical cleav- 

30 age sites, preferably employing the Bst XI restriction 

endonuclease, may be used in parallel with cDNA synthetic 
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oligonucleotides, giving the same terminii as the re- 
placeable segment of the vector- In this manner, the 
cDNA cannot ligate to itself, but can ligate to the 
vector. This allows the most efficient use of both cDNA 
0 5 and vector* 

Another embodiment of the present invention is the 
above-described efficient oligonucleotide-based strategy 
to promote cDNA insertion into the vector. The piH3M 
vector of the present invention is preferred, and employs 
10 the inverted endonuclease sites. This vector may contain 
an SV40 origin of replication, but a more preferred form 
contains an M13 origin. This vector, containing the M13 
origin, allows high level expression in COS cells of 
coding sequences placed under its control. Also, the 
15 small size and particular arrangement of sequences in the 
plasmid permit high level replication in COS cells. 

By "cell surface antigen" is meant any protein that 
is transported through the intracellular membrane system 
to the cell surface. Such antigens normally are anchored 

2 0 to the cell surface membrane through a carboxyl terminal 

domain containing hydrophobic amino acids that lie in the 
lipid bilayer of the membrane, and there exert their 
biological and antigenic effects. Antigens such as those 
of T-lymphocytes are particularly suited for gene cloning 
25 by the method of the present invention . However, cell 

surface antigens of any cells may be cloned according to 
the present method. Moreover, proteins not normally 
expressed on the cell surface may admit of cloning 
according to the present method by, for example, using 

3 0 fluorescence activated cell sorting (FACS) to enrich for 

fixed cells expressing intracellular antigens. 
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By "substantially pure 11 is meant any antigen of the 
present invention, or any gene encoding any such antigen, 
which is essentially free of other antigens or genes, 
respectively, or of other contaminants with which it 
might normally be found in nature, and as such exists in 
a form not found in nature. By "functional derivative" 
is meant the "fragments," "variants, 11 "analogs," or 
"chemical derivatives" of a molecule- A "fragment" of a 
molecule, such as any of the antigens of the present 
invention, is meant to refer to any polypeptide subset of 
the molecule- A "variant" of such molecules is meant to 
refer to a naturally occurring molecule substantially 
similar to either the entire molecule, or a fragment 
thereof o An "analog" of a molecule is meant to refer to 
a non-natural molecule substantially similar to either 
the entire molecule or a fragment thereof „ 

A molecule is said to be "substantially similar" to 
another molecule if the sequence of amino acids in both 
molecules is substantially the same, and if both mole- 
cules possess a similar biological activity • Thus, 
provided that two molecules possess a similar activity, 
they are considered variants as that term is used herein 
even if one of the molecules contains additional amino 
acid residues not found in the other, or if the sequence 
of amino acid residues is not identical. As used herein, 
a molecule is said to be a "chemical derivative" of 
another molecule when it contains additional chemical 
moieties not normally a part of the molecule. Such 
moieties may improve the molecule's solubility, absorp- 
tion, biological half life, etc. The moieties may 
alternatively decrease the toxicity of the molecule, 
eliminate or attenuate any undesirable side effect of the 
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molecule, etc. Moieties capable of mediating such 
effects are disclosed, for example, in Remington 1 s 
Pharmaceutical Sciences , 16th ed. , Mack Publishing Co., 
Easton, Penn. (198 0) . 
05 Similarly, a "functional derivative" of a gene of 

any of the antigens of the present invention is meant to 
include "fragments," "variants," or "analogues" of the 
gene, which may be "substantially similar" in nucleotide 
sequence, and which encode a molecule possessing similar 
10 activity. 

The substantially pure antigens that have been 
expressed by methods of the present invention may be used 
in immunodiagnostic assay methods well known to those of 
skill, including radio-immunoassays (RIAs) , enzyme 
15 immunoassays (EIAs) and enzyme-linked immunosorbent 

assays (ELISAs) . The substantially pure proteins of the 
present invention, in soluble form, may be administered 
alone or in combination with other antigens of the 
present invention, or with other agents', including 
2 0 lymphokines and monokines or drugs, for the treatment of 
immune-related diseases and disorders in animals, 
including humans. As examples of such disorders that may 
benefit from treatment with the substantially pure 
proteins of the present invention may be mentioned immune 

2 5 deficiency diseases, diseases of immediate type 

hypersensitivity, asthma, hypersensitivity pneumonitis, 
immune-complex disease, vasculitis, systemic lupus 
erythematosus, rheumatoid arthritis, immunopathogenic 
renal injury, acute and chronic inflammation, hemolytic 

3 0 anemias, platelet disorders, plasma and other cell 

neoplasms, amyloidosis, parasitic diseases, multiple 
sclerosis, Guillain-Barre syndrome, acute and subacute 
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myopathic paralysis, myasthenia gravis, immune 
endocr inopathies , and tissue and organ transplant re- 
jection, all as described in Petersdorf et al a , eds., 
Harrison's Principles of Internal Medicine , supra . See 

05 also Weir, ed. , supra ; Boguslaski et al. , eds., supra ; 
and Holborow et al. , eds. , supra 0 

When used for immunotherapy, the antigens of the 
present invention may be unlabeled or labeled with a 
therapeutic agent. Examples of therapeutic agents which 

10 can be coupled to the antigens of the invention for 
immunotherapy are drugs, radioisotopes, lectins, and 
toxins . 

The dose ranges for the administration of the 
antigens of the present invention are those large enough 

15 to produce the desired immuno therapeutic effect, but not 
so large as to cause adverse side effects, such as 
unwanted cross-reactions, anaphylactic reactions, and the 
like. Generally, the dosage employed will vary with the 
age, condition, sex, and extent of the disease in the 

20 patient. Counterindications (if any) , immune tolerance 
and other variables- also will affect the proper dosage. 
Administration_ may be parenteral, by injection or by 
gradual perfusion over time. Administration also may be 
intravenous , intraparenteral , intramuscular , 

25 subcutaneous, or intradermal. 

Preparations for parenteral administration include 
sterile or aqueous or non-aqueous solutions, suspensions 
and emulsions. Examples of non-aqueous solvents include 
propylene glycol, polyethylene glycol, vegetable oils 

30 such as olive oil, and injectable organic esters such as 
ethyl oleate. Aqueous carriers include water, alcoholic 
and aqueous solutions, emulsions, or suspensions, 
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including saline and buffered media. Parenteral vehicles 
include sodium chloride solution, Ringer's dextrose, 
dextrose and sodium chloride, lactated Ringer's, or fixed 
oils. Intravenous vehicles include fluid and nutrient 
05 replenishers, electrolyte replenishers , such as those 

based on Ringer's dextrose, and the like. Preservatives 
and other additives also may be present, such as, for 
example, antimicrobials, antioxidants, chelating agents, 
inert gases and the like. Such preparations, and the 
10 manner and method of making them, are known and de- 
scribed, for example, in Remington 1 s Pharmaceutical 
Science , 16th ed. , supra . 

The antigens of the present invention also may be 
prepared as medicaments or pharmaceutical compositions 
15 comprising the antigens, either alone or in combination 
with other antigens or other agents such as lymphokines, 
monokines, and drugs, the medicaments being used for 
therapy of animal, including human, immune-related 
indications o 

2 0 Although the antigens of the present invention may 

be administered alone, it is preferred that they be 
administered as a pharmaceutical composition. The 
compositions of the present invention comprise at least 
one antigen or its pharmaceutically acceptable salt, 
25 together with one or more acceptable carriers and option- 
ally other therapeutic agents. By "acceptable" is meant 
that the agent or carrier be compatible with other 
ingredients of the composition and not injurious to the 
patient. Compositions include those suitable for oral, 
30 rectal, nasal, topical (including buccal and sublingual), 
vaginal, or parenteral administration. The compositions 
conveniently may be presented in unit dosage form, and 
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may be prepared by methods well known in the pharmaceuti- 
cal arts. Such methods include bringing into association 
the active ingredient with the carrier which constitutes 
one or more accessory ingredients- In general, composi- 

05 tions are prepared by uniformly and intimately bringing 
into association the active ingredient with liquid 
carriers or finely divided solid carriers, or both, and 
shaping the product formed thereby, if required* 

Orally administered pharmaceutical compositions 

10 according to the present invention may be in any conve- 
nient form, including capsules, cachets, or tablets, each 
containing a predetermined amount of the active ingredi- 
ent* Powders or granules also are possible, as well as 
solution or suspension in aqueous or nonaqueous liquids, 

15 or oil-in-water liquid emulsions, or water-in-oil liquid 
emulsions. The active ingredient also may be presented 
as a bolus, electuary or paste. 

Having now described the invention, the same will be 
more fully understood by reference to the following 

20 examples, which are not intended in any way to limit the 
scope of the invention - 

EXAMPLE I Isolation, Molecular Cloning, and Structure of 
the Human CD2 Antigen 

The cDNA expression vector piH3 

25 A C °S cell expression vector was constructed from 

piSV (Little et al . , MoL Biol. Med, 1:473-488 (1983)) by 
inserting a synthetic transcription unit between the 
suppressor tRNA gene and the SV4 0 origin . The tran- 
scription unit consisted of a chimeric promoter composed 

3 0 of human cytomegalovirus AD169 immediately early enhancer 
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sequences fused to the HIV LTR -67 to +80 sequences. 
Immediately downstream from the LTR +8 0 sequence was 
inserted a polylinker containing two Bst XI sites separat- 
ed fay a 350bp stuffer; the Bst XI sites were flanked by 
05 Xbal sites, which could also be used to excise the 

insert. Downstream from the polylinker were placed the 
SV4 0 small t antigen splice and early region 
polyadenylation signals derived from pSV2 . The 
nucleotide sequence of the vector is shown in Figure 1. 

10 cDNA library construction 

RNA was prepared from HPB-ALIj cells by the 
guanidinium thiocyanate/CsCl method, as described above. 
PolyA 4 " RNA was prepared from total RNA by oligo dT 
selection, Maniatis et al , Molecular Cloning: A Labo- 
15 ratory Manual , supra . cDNA was synthesized by the method 
of Gubler and Hoffman ( Gene 25^:263-269 (1982)). Bst XI 
adaptors were ligated to the cDNA, and the reaction 
products fractionated by centrif ugation through a 5ml-2 0% 
potassium acetate gradient containing ImM EDTA for 3 
20 hours at SOkrpm in a SW55 rotor. 0.5ml fractions were 

collected manually through a syringe needle or butterfly 
inserted just above the curve of the tube. Individual 
fractions were ethanol-precipitated after addition of 
linear polyacrylamide (Strauss and Varshavsky, Cell 
25 37:889-901 (1984)) to 20 ug/ml \ Fractions containing 

cDNA larger than 700bp were pooled and ligated to gradi- 
ent purified Bst XI digested piH3 vector. 

The ligated DNA was transformed into E. coli 
MC1061/p3 made competent by the following protocol: The 
30 desired strain was streaked out on an LB plate. The next 
day a single colony was inoculated into 20ml TYM broth 
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(recipes below) in a 250ml flask, The cells were grown 
to midlog phase (OD^ 0Q about 0.2-0.8), poured into a 21 
flask containing 100ml TYM r and vigorously agitated until 
cells grew to 0.5-0/9 OD, then diluted again to 500ml in 
05 the same vessel. When the cells grew to OD- rtrt 0.6, the 

QUO 

flask was placed in ice-water, and shaken gently to 
assure rapid cooling,, When the culture was cool, it was 
spun at 4«2k rpm for 15 minutes (J6) * The supernatant 
was poured off and the pellet resuspended in about 100ml 

10 cold TfB I (below) by gentle shaking on ice- Thereafter, 
it was respun in the same bottle at 4*2krpm for 8 minutes 
(J6) o The supernatant was poured off and the pellet 
resuspended in 2 0 ml cold TfB II by gentle shaking on 
ice. 0.1 to 0.5 ml aliquots were placed in prechilled 

15 microfuge tubes, frozen in liquid nitrogen, and stored at 
— 70 °Co For transformation, an aliquot was removed, 
thawed at room temperature until just melting, and placed 
on ice. DNA was added, let sit on ice 15-3 0 minutes, and 
incubated at 37 °C for 5 minutes (6 minutes for 0.5ml 

20 aliquots) o Thereafter the DNA-containing suspensions 

were diluted 1:10 in LB and grown for 90 minutes before 
jplating or applying antibiotic selection. Alternatively, 
the heat-pulsed transformation mix was plated directly on 
antibiotic plates onto which a thin (4-5ml) layer of 

25 antibiotic- free LB agar was poured just before plating. 

Media and Buffers; TYM: 2% Bacto-Tryptone, 0.5% 
Yeast Extract, 0.1M NaCl, lOmM Mgs0 4 (can be added before 
autoclaving) . TfB I: 30mM KOAc, 50mM MnCl 2 , lOOmM KCL, 
lOmM CaCl 2 , 15% (v/v) glycerol. TfB II: lOmM Na-MOPS, 

3 0 pH 7.0, 75 MM CaCl 2 , lOmM KCl, 15% glycerol. 
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Recovery of cDNA clones by panning 

Bacteriological culture dishes (Falcon 1007) were 
prepared for panning by coating with an affinity purified 
sheep anti-mouse IgG antibody as described by Wysocki and 
05 Sato ( Proc. Natl. Acad. Sci. USA 75 :2844-2848 (1978)), 
except that dishes were washed with 0 . 15M NaCl from a 
wash bottle instead of PBS, and unreacted sites were 
blocked by overnight incubation in PBS containing Img/ml 
BSA. Dishes were typically prepared in large batches and 
10 stored frozen, after aspiration of the PBS/BSA. In the 
first round of screening, 24 6cm dishes of 50% confluent 
COS cells were transfected by protoplast fusion according 
to the method of Sandri-Goldrin et al . , Mol . Cell Biol. 
1: 743-752 (1981) . 72 hours post fusion the cells were 
15 detached by incubation in FBS/lmM EDTA/.02% sodium azide 
at 37 °c for 3 0 minutes. The detached cells were pooled, 
centrifuged, and resuspended in cold PBS/EDTA/5% Fetal 
Bovine Serum containing monoclonal antibodies, usually as 
ascites at Is 1000 dilution, but also as commercial 
20 reagents at the concentrations suggested by the manufac- 
tures. After 1 hour on ice, the cells were diluted with 
1:1 with PBS/EDTA/azide and layered on 10ml of PBS/EDTA/ 
azide containing 2% Ficoll 400 o After centrifugation 
(4 00xg, 5 minutes) , the supernatant was carefully as- 
25 pirated, the pellet resuspended in a small amount of 

PBS/EDTA/5% FBS, and the cells distributed into panning 
plates containing 3 ml of PBS/EDTA/5% FBS. The plates 
were then treated essentially as described by Wysocki and 
Sato. Proc. Natl. Acad. Sci. USA 75:2844-2848 (1978) . 
3 0 Episomal DNA was recovered from the adherent cells by the 
Hirt (j\ Mol. Biol. 2_§ :365 " 269 (1967)) procedure and 
transformed into MC1061/p3. 
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Cell lines and cell culture 

COS cell clone M6 cells were propagated in 
Dulbecco's modified Eaglet medium supplemented with 10% 
calf serum and gentamycin sulfate at 15ug/ml "( DM E/l°Sr 

05 calf serum) . Cells were split the day before 

transfection in 6 cm dishes at approximately 1:8 ratio 
from stock plates kept as dense as possible without 
overtly affronting the cells „ T cell lines were grown in 
Iscove's modification of Dulbecco's medium (IMDM) con- 

10 talning gentamycin as above, and either NuSerum (Collab- 
orative Research) or fetal bovine serum at 10% • 



COS cell transfection for immunofluorescence studies 
COS cells at 50% confluence in 6 cm dishes were 
transfected in a volume of 1- 5ml with a cocktail consist- 

15 ing of DME or IMDM medium containing 10% NuSerum (Collab- 
orative Research) , 400ug/ml DEAE Dextran, lOuM 
chloroquine diphosphate, and lug/ml DNA. After 4 hours 
at 37 *C (or earlier if the cells appeared ill), the 
transfection mix was removed and the cells were treated 

20 with 10% DMSO in PBS for 2 minutes. Sussman and Milman, 
Cell Biol. At 1641-1643 (1984) - Cells were then returned 
to DME/10% calf serum for 48 to 72 hours to allow ex- 
pression. 

Immunoprecipitations , Northerns and Southerns 
25 T cells were labeled by lactoperoxidase treatment, 

lysed, and immunoprecipitated by the procedure of Clark 
and Einfeld ( Leukocyte Typing II , Vol, II, pp. 155-167 
(1986)), using commercially available goat anti-mouse IgG 
agarose beads (Cooper Biomedical) - COS cells were 
3 0 transfected by DEAE Dextran method and trypsinized and 
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passed without dilution into new plates 24 hours after 
transfection. 3 6 hours later, cells were detached by 
exposure to PBS/EDTA as above, centrifuged, and labeled 
by the lactoperoxidate method . A cleared lysate was 

05 prepared as for the T cell immunoprecipitations , except 
that the lysis buffer contained ImM PMSF, and incubation 
with the primary antibody was carried out for only 2 
hours at 4°C. Eluted samples were fractionated on 
discontinuous 11.25% polyacrylamide gels using the buffer 

10 system of Laemmli (Nature 227 : 680-685 (1970)). 

Northern blot analysis was carried out essentially 
as described (Maniatis et al . , Molecular Cloning, a 
Laboratory Manual (1982)), except that DMSO was omitted 
from the loading buffer, denaturation was at 70 ° C for 5 

15 minutes, and the gel contained 0-6% formaldehyde rather 
than 6%. The gel was stained in two volumes of water 
containing lug/ml ethidium bromide, photographed, and 
transferred to nylon (GeneScreen, DuPont) in the staining 
liquor- The transferred RNA was irradiated by exposure 

2 0 to a germicidal lamp through Saran Wrap (Church and 

Gilbert, Proc. Natl. Acad. Sci . USA J3: 1991-1995 (1984)) 

for 5 minutes at a flux (measured at 254nm) of 
2 

0„22mW/cm . Southern blot analysis was carried out by 
alkaline transfer to nylon (GeneScreen, DuPont) as 

25 described by Reed and Mann ( Nucl. Acids Res. 13 : 7207-7221 
(1986)). Hybridization probes were prepared by the 
method of Hu and Messing (Gene 182271-277 (1982)), and 
blots were prehybridized in SDS/phosphate buffer (Church 
and Gilbert, Proc. Natl. Acad. Sci, USA _8: 1991-1995 

30 (1984)) containing 10 DNA microgram equivalents of M13 
mpl9 phage. 
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Erythrocyte Rosetting 

Erythrocytes were prepared from whole blood by three 
centrif ligations in PBS. COS cells were transfected in 6 
cm dishes with CD2 or other surface antigen expression 
05 clones by the DEAE method. 48 to 72 hours 

posttransfection, the medium was aspirated and 2ml of 
PBS/5% FDS/azide was added to each plate , followed by 
0.4ml of the appropriate erythrocyte samples as 20% 
suspensions in PBS« After 1 hour at room temperature, 
10 the nonadherent erythrocytes were gently washed off, and 
the plates examined «, 

A cDNA encoding CD2 antigen determinants was 
isolated in the following manner: cDNA was prepared from 
RNA extracted from the human T Cell tumor line HPB-AI*Ii 
15 and inserted into the SV4 0 origin-based expression vector 
piH3 as described above „ A cDNA library of approximately 
3 ^ 10 5 recombinants was constructed, and the library was 
introduced into COS cells by protoplast fusion. Three 
days later the cells were detached by exposure to EDTA 
20 and treated with a pool of monoclonal antibodies, 

including three (OKT11, Le U 5b, and Coulter Til) directed 
against CD2 determinants. The antibody-treated cells 
were distributed into dishes coated with an affinity 
Purified sheep anti-mouse igG antibody, allowed to 
25 attach, and separated from the nonadherent cells by 

gentle washing. This method of enrichment is known in 
the immunological literature (Mage et al . , J. Immunol, 
Methods 15:47-56 (1977) o 

The resulting colonies were pooled, fused into COS 
3 0 cells, and subjected to a second round of panning as 

before. In the third round, a portion of the detached 
cells was treated with a mixture of three "monoclonal 
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antibodies specific for CD2 , and a Hirt supernatant was 
again generated and transformed into E. coli , DNA was 
prepared from eight of the resulting colonies and 
transfected into COS cells. After three days*, surface 

0 5 expression of the CD2 antigen was detected by indirect 
immunofluorescence in six of eight transfected dishes . 
Restriction enzyme digestion of the corresponding plasmid 
DNAs revealed a l.Skb insert in all six isolates. 

One of the six clones was prepared in larger quan- 

10 tities for further analysis. Following transfection into 
COS cells, indirect immunofluorescence analysis with a 
partial panel of antibodies provided by the Third Inter- 
national Workshop on Leukocyte Differentiation Antigens 
showed that all of the antibodies provided gave positive 

15 reactions with the exception of one sample which also 
failed to react with phytohemagglutinin-activated T 
lymphocytes. Among the 17 antibodies tested were at 
least eight distinguishable groups defined by their 
differing patterns of reactivity with lymphocytes of 

20 various primate species o Jonker and Nooij , Leukocyte 
Typing II , Volo I , pp. 373-387 (1986) . 

cDNA sequence analysis 

The CD2 cDNA insert was subcloned into M13 mpl9 
(Vieira and Messing, Gene 19:259-268 (1982)) in both 

25 orientations, and the sequence determined by the 

dideoxynucleotide method (Figure 2). Sanger et al . , 
Proc. Natl. Acad. Sci. USA 74:5463-5467 (1977) • An open 
reading frame was observed to extend 3 60 residues from an 
ATG triplet satisfying the consensus criteria of Kozak 

30 ( Microbiol . Rev. : 1-47:45 (1983)) for translat ional 

initiation codons (Figure 1) . The predicted amino acid 
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sequence evokes an integral membrane protein with a 
single membrane spanning hydrophobic anchor terminating 
in a rather large intracytoplasmic domain „ Comparison of 
the N— terminal amino sequence with the matrix of signal 

05 sequence" residue frequencies constructed by von Heijne 
( Nuclo Acids Res. 14 :4683-4690 (1986)) suggests that 
mature CD2 peptide is formed by cleavage of a precursor 
peptide between the 19th (Ser) and 20th (Lys) residues. 

A surprising and unexpected feature of this sequence 

10 'is the presence of a potential N-linked glycosylation 
site ju£t proximal to the proposed cleavage site. The 
resulting polypeptide backbone has a predicted molecular 
weight of 38„9kd divided into an external domain of mass 
21o9kb and a cytoplasmic domain of mass 14,6kd. Three 

15 N-linked glycosylation sites are present in the extra- 
cellular 4 - domain. 

The- -membrane spanning domain comprises 26 unchanged 
residues- of predominantly hydrophobic character. In the 
nine residues immediately following are seven basic 

20 residue^ either lysines or arginines* The appearance of 
predominantly hydrophobic residues followed by basic 
residues" is a common organizational feature of 
transmembrane proteins bearing carboxyl -terminal anchors. 
Another surprising feature of the transmembrane 

25 domain is the appearance of a cys-gly-gly-gly , a beta 
turn motif (Chou and Fasman, Annual Review of 
Biochemistry. 47:251-276 (1978)), flanked by hydrophobic 
residues (which are frequently found flanking beta 
turns) . . . Because only 20 residues arrayed in an alpha 

3 0 helix are theoretically needed to traverse the 3nm 
membrane bilayer (Tanford, Science , 200 :1012-1018 
(1978) ), and as few as 14 hydrophobic residues can allow 
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insertion and export of an integral membrane protein 
(Adams and Rose, Cell , 41:1007-1015 (1985)), the trans- 
membrane segment of the CD2 antigen may contain a bend or 
kink. 

05 The rather large size of the cytoplasmic domain 

leaves open the possibility that CD2 possesses an 
intrinsic enzymatic activity. The cytoplasmic domain is 
very rich in prolines and contains three sites with high 
turn probability. 
10 Comparison of the amino acid sequence with the NBRF 

database revealed no substantive homologies with other 
proteins. In particular, no homology with the T cell 
receptor alpha or beta chains was observed, ruling out 
the suggestion that CD2 is a primordial T cell receptor. 
15 Milanese et al. , Science , 231 : 1118-1122 (1986). 

Just inside the cytoplasmic face of the protein is a 
run of basic proteins followed by a serine residue, a 
pattern found at the same location in both the EGF 
receptor and the class I histocompatibility genes, iand in 
2 0 each case a known site for either in vivo (EGF) and in 

vitro (HLA) phosphorylation by protein kinase C or cyclic 
AMP-dependent protein kinase, respectively. Hunter et 
al. , Nature, 311 :480-482 (1984); Davis and Czech (1985); 
Guild and Strominger (1984) . A similar site is found in 
25 the intracytoplasmic domain of the inter leukin 2 

receptor, and is phosphorylated in vivo by protein kinase 
C Leonard et al. , Nature, 311 : 626-631 (1984); Nikaido 
et al.. , Nature, 311 :631-635 (1984); Shackelford and 
Trowbridge, J. Biol. Chem. , 259:11706- (1984). 
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Immunoprecipitation of CD2 antigen expressed by 
transfected cells 

COS cells were transfected with the CD2 expression 
plasmid and surface labeled with 125^ by the * 

05 lactoperoxidase method 60 hours post-transf ection . A 
cell lysate was prepared, and portions were incubated 
either with monoclonal anti-CD2 antibody (OKT11) or with 
an extraneous (OKT4 r anti-CD4) antibody for 2 hours at 
4°C_ Sepharose— bound anti-mouse antibody was added, and 

10 after several washing steps, the adsorbed proteins were 
eluted and electrophoresed through a 11.25% acrylamide 
gel together with similarly prepared immunoprecipitates 
from phytohemagglutinin-activated T lymphocytes, the cDNA 
donor line HPB— AIjIi, or a long— term T cell line generated 

15 in this laboratory" Autoradiography demonstrated a 

prominent band of immunoreactive material precipitated 
from transfected cos cells by the anti-CD2 antibody, but 
not by the control » The calculated mean molecular weight 
of the COS cell material was 51kd, compared to a mean 

2 0 molecular weight of 5.4kd for the T blast and T cell line 
material? the antigen from HPB-ALIi cells was found to 
have a molecular weight of approximately 61kd« The 
observed differences in size were attributed to different 
patterns of glycosylation in the different cell types * A 

25 minor band of apparent molecular weight 3 8kd was present 
in material immunoprecipitated from COS cells but not 
from T cells or HFB-ALL cells « The size of this species 
agrees within experimental error with the predicted 
molecular weight of mature unglycosylated peptide, 3 9kd. 



30 COS cells expressing CD2 form rosettes with sheep 



erythrocytes 

COS cells transfected with the CD2 expression clone 
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were treated for 1 hour with purified MT910 (IgG, kappa) 
anti-CD2 antibody (Rieber et al . , Leukocyte Typing II , 
Vol. I, pp. 233-242 (1986)) at a concentration of lug/ml, 
or with purified MB40.5 (IgGl, kappa; Kawata et al . , J. 

05 Exp . Med . , 160 : 633-651 (1984)) antibody at the same 

concentration. MB4 0.5 recognizes a monomorphic H LA —ABC 
determinant and Cross-reacts with African Green Monkey 
histocompatibility antigens; it was chosen because it 
represents an isotype-matched antibody recognizing a 

10 surface antigen of approximately the same abundance as 
the CD2 antigen expressed by transfected cells. Sheep 
erythrocyte rosettes were observed in the presence of 
MB40.5, but not of MT910. Rosette inhibition was also 
observed with OKT11 antibody, and not with various other 

15 control antibodies. 

Transfected COS cells form rosettes with other animal 
erythrocytes 

In addition to sheep erythrocytes, human T cells are 
known to form rosettes with horse, pig, dog, goat, and 
2 0 rabbit, but not mouse or rat erythrocytes. Johansen et 

al. , J. Allergy Clin. Immunol. , 54 : 86-94 (1974); Amiot et 
al . in, A. Bernard et al . , eds., Leucocyte Typing , 
Springer, publisher, New York, N.Y., pp. 281-293 (1984); 
Nalet and Fournier, Cell . Immunol . , 96:126-136 (1985). 

2 5 Autorosettes between human erythrocytes and human 

thymocytes (Baxley et al . , Clin. Exp. Immunol. , 
15^:385-393 (1973)) have also been reported. COS cells 
transfected with the CD2 expression clone were treated 
with either MT910 or with the control antibody, MB40.5, 

3 0 and exposed to erythrocytes from the species above. 
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Rosettes were observed with horse, pig, dog, goat, sheep, 
rabbit, and human erythrocytes, but not with mouse or rat 
erythrocytes. Rosette formation was blocked by 
pretreatment of transfected COS cells with MT910, but not 

05 with MB40o5o In these experiments, it was noticed that 
horse erythrocytes formed unusually dense rosettes, and 
that goat erythrocytes formed rather sparse rosettes, 
possibly because their small size made them more 
susceptible to washing • Mouse erythrocytes showed weak 

10 spontaneous binding to the culture dish as well as to 

MT910 and MB4CK5 pretreated cells , while rat erythrocytes 
showed no detectable binding of any sort. 

Binding- of human erythrocytes is blocked by LFA3 antibody 
Because it has been suggested on the basis of 

15 antibody blocking studies that IiFA3 is the target 
structure for the CD2 antigen (Shaw et al . , Nature, 
323 ;262-264 (1986) ) r the ability of anti-LFA3 antibody to 
prevent rosette formation was investigated. Transfected 
cells were exposed to human erythrocytes pretreated for 2 

20 hours with either anti-LFA3 (IgGl, kappa) as ascites at 
1:1000 dilution, or with a lOug/ml concentration of each 
of four isotype-matched nonagglutinating antibodies 
directed against human erythrocyte antigens as prevalent 
or more prevalent than LFA3:G10/B11 and D10, anti-K14 

25 antigen, D6, anti-Wr b antigen; and F7/B9, anti-k antigen. 
Nichols et al. f Vox Saner , in press. The erythrocytes 
were washed free of excess LFA3 antibody, but were 
allowed to form rosettes in the presence of the control 
antibodies to guard against possible loss of antibody 

30 blocking power by desorption. Rosette formation was 
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observed in the presence of all four control antibodies, 
but not with erythrocytes pretreated with anti-LFA3 - 

COS cells expressing other T cell antigens do" not form 
rosettes 

05 A number of clones were isolated by the same 

expression technique used to clone CD2 and characterized 
to varying degrees by antibody reactivity, nucleic acid 
restriction and sequence analysis, and immunoprecipi- 
tation. Representative clones were trans feet ed into COS 

10 cells and analyzed for ability to sustain rosette 

formation. The CDla, CDlb, CDlc, CD4 , CDS, CD6 , CD6 , 
CD8, and CD28 (Tp44) clones did not form rosettes with 
human erythrocytes • 

RNA blot analysis 
15 Equal amounts of total RNA prepared from cell types 

expressing or lacking CD2 antigen were electrophoresed 
through denaturing agarose gels and transferred to nylon. 
Hybridization of the transferred RNA with a strand 
selective probe (Hu and Messing, Gene , 17:271-277 (1982)) 
2 0 prepared from an M13 clone containing a CD2 cDNA insert 
revealed the presence of prominent 1.65 and 1.3 kb 
transcripts present in RNA derived from thymocyte, 
activated T cell, and senescent T cell populations. 
Lesser amounts were found in RNA extracted from the cDNA 
25 donor line, HPB-ALL and less still from MOLT4 ; barely 
detectable levels were recorded in RNA from the HS-B-2 
line. No reactivity was observed with RNA from Namalwa 
(Burkitt lymphoma) , U937 (histiocytic leukemia) , HuT-78 
(Adult T cell leukemia) , PEER (T cell leukemia) , or 
30 Jurkat clone J3R7 (T cell leukemia) lines.' The pattern 
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of reactivity conformed well with the known or measured 
pattern of expression of CD2 antigen, which was absent or 
indetectable on the Namalwa, *937, HuT-78, J3R7, PEER, 
and HSB-2 cell lines, weakly present on MOLT 4*, more 

05 strongly present on EPB-ALL, and most strongly present on 
activated T cells- Thymocytes axe also known to express 
high levels of CD2 antigen* 

Examination of the sequence of the cDNA clone 
suggested that the X<,3kb RNA might arise by formation of 

10 an alternate 3 1 end distal to the canonical 

polyadenylation signal AATAAA at position 1085 in the 
cDNA sequence o To test this notion, RNA from HPB-ALL and 
activated T cells was subjected to Northern blot analysis 
and hybridized either with a complete cDNA probe, or with 

15 a probe derived from the 3 11 portion of the cDNA distal to 
nucleotide 113 L The latter probe reacted only with the 
l-65kb species, while the former showed the same 
reactivity pattern observed in Figure 5- This result is 
consistent with the suggested origin of the 1.3kb 

20 transcript o 

In both activated and senescent T cell RNA prepara- 
tions, a weakly hybridizing transcript of approximately 
0.75kb was detected- At present the origin of this RNA 
is unknown* 

25 Genomic organization of the CD2 crene 

Southern blot analysis of genomic DNA from placenta, 
peripheral blood lymphocytes, T cells, HeLa cells, or the 
tumor lines used in the RNA analysis above showed 
identical BamHI digest patterns, indicating that 

3 0 rearrangement is not involved in the normal expression of 
the CD2 gene during development. Similarly, no gross 



WO 89/08114 



PCT/US88/02809 



-61- 

genomic alteration underlies the failure of the examined 
T cell tumor lines to express CD2 antigen* Restriction 
analysis of total -genomic DNA with a number of other 
enzymes, as well as preliminary results with an 
05 incomplete collection of 1 phage recombinants bearing the 
CD2 sequence, shows that the gene is divided into at 
least four exons „ 

EXAMPLE II Isolation and Molecular Cloning of 

Human LFA-3 Antigen 

10 The previous example shows that cDNAs encoding 

surface antigens, such as the CD2 antigen, can be 
isolated by the transient expression system of the 
present invention, in which COS cells transfected with 
cDNA libraries are allowed to attach to ("panned" on) 
15 antibody-coated plates* Plasmid DNA is recovered from 
cells adhering to the plates , transformed into E. coli , 
and the process is repeated, usually twice, to isolate 
the desired clone. Although powerful, this approach 
cannot be used when the monoclonal antibodies used for 
20 panning recognize determinants on the untransf ected 
cells. This appears to be the case for anti-LFA3 
monoclonal TS2/9. However, a similar transient 
expression system based on polyoma virus replication- 
competent cells should allow almost all monoclonals to be 
25 used, since the probability of cross reaction between 
murine antibodies and murine cell surface determinants 
should usually be small. 

A new expression vector, CDM8 (Figure 3) was created 
from the COS cell vector piH3M described previously. The 
30 new vector differs by the inclusion of a deleted version 
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of a mutant polyoma virus early region selected for high 
efficiency expression in both murine and monkey cells, by 
the replacement of substantially all of the human 
immunodeficiency promoter region with the cognate 

05 sequences of the human cytomegalovirus immediate early 

promoter, and by inclusion of a bacteriophage T7 promoter 
between the eukaryotic promoter and the site of cDNA 
insertion o Expression in COS cells of chloramphenicol 
acetyltransf erase by all of the vectors was equivalent, 

10 A library of 1.9 x 10 6 recombinants having inserts 

greater than 0„8kb in size was prepared in the CDM8 
vector from a microgram of poly A+ RNA isolated from the 
human lymphoblastoid cell line JY„ The library was 
introduced into WOP cells (NIH 3T3 cells transfected with 

15 polyoma origin deletion DNA) by spheroplast fusion, and 
subjected to three rounds of panning and reintroduction 
into Eo coli as described in Example L 

A clone encoding the LFA-3 antigen was identified by 
indirect immunofluorescence of transfected WOP cells, 

20 amplified and sequenced (Figure 4) „ Within the 874 bp 

insert, an open reading frame of 237 residues originates 
at a methionine codon closely corresponding to the 
consensus sequence suggested by Kozak, Microbiol. Rev. , 
47:1-45 (1983) o The reading frame terminates in a series 

25 of hydrophobic residues lacking the characteristic basic 
anchoring residues of internal membrane proteins, but 
sharing features with known phosphatidylinositol-linked 
superficial membrane proteins. The features include 
clustered serine or threonine residues in a hydrophilic 

30 region immediately preceding the hydrophobic domain, and 
the presence of serines and threonines in the hydrophobic 
portion. 
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The amino acid sequence predicted from the 
nucleotide sequence of the LFA— 3 clone was compared to 
the NBRF database/ and no significant homologies were 
uncovered; the most significant scores were to the HIV 

05 envelope protein. Within the 200 residues comprising the 
presumed mature protein are 6 N-l inked glycosylation 
sites, and 5 tandem serine or tandem threonine residues 
that frequently appear in O-l inked glycosylated proteins. 
Ten cysteine residues appear in the complete sequence, 6 

10 of which are distributed in the latter half of the mature 
protein, and one of which falls in the carboxy-terminal 
hydrophobic domain. Although esterif ication of cysteine 
thiols to fatty acids is a common occurrence in integral 
membrane proteins, and may play an alternate role in 

15 membrane anchoring of LFA— 3 , two examples are known of 
cysteine residues within or at the margin of the 
hydrophobic region of phosphatidylinositol linked 
proteins. 

The predicted sequence suggests that the known 
2 0 manipulations for increasing erythrocyte adhersion to T 
cells may find direct physical explanation in the 
structure of the LFA- 3 molecule, Aminoethylisothiouroni- 
um bromide, the thiourea adduct of bromoethyl amine , 
undergoes spontaneous rearrangement to mercaptoethyl- 

2 5 guanidine at alkaline pH. The latter likely gains access 

to disulfide bonds inaccessible to less chaotropic 
reducing agents and may thereby reduce and promote the 
unfolding of the LFA- 3 molecule. Similarly, 
neuraminidase may decrease steric interference by the 

3 0 many carbohydrate chains on the molecule. 

RNA and DNA blot hybridization analysis showed that 
the LFA -3 gene shares no closely related sequences in the 
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genome, and encodes a single RNA species of about 1 kb in 
length. Cell lines that express large amounts of surface 
LFA-3 have greater amounts of LFA-3 RNA than those that 
express small or nondetectable amounts. 
05 Radioimmunoprecipitation of the antigen expressed in 

trans fected COS and murine cells shows a broad band of 
approximately 50kd mean molecular mass, similar to that 
found in JY cells. 

EXAMPLE III Isolation and Molecular Cloning of the 
10 Human CD28 cDNA Antigen 

The previous examples illustrate the monoclonal 
antibody-based technique of the present invention for 
enrichment of cDNAs encoding surface antigens. In the 
present example, a method of constructing plasmid 

15 expression libraries is described which allows the 

enrichment technique to be fully exploited. The method 
of the present invention for making plasmid expression 
libraries is of general use for expression cloning. 

The antibody selection technique of the present 

20 invention has also been applied to isolate a cDNA clone 
encoding the CD28 antigen. The antigen shares 
substantial homology with members of the immunoglobulin 
superfamily and forms a dimer structure on the surface of 
trans fected COS cells similar to the dimer structure 

25 found on T lymphocytes. 

Preparation of cDNA Libraries. Poly (A) + rna was prepared 
from the human T-cell tumor line HPB— ALL by oligo(dT) 
cellulose chromatography of total RNA isolated by the 
guanidinium thiocyanate method (Chirgwin, J.M. etaL, 
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Biochemistry , 18:5294-5299 (1979)). cDNA was prepared by 
a protocol based on the method of Gubler and Hoffman 
(Gubler, U. et al.' , Gene , 25:263-269 (1982)). 4ug of 
mRNA was heated to approximately 10 0°C in a 1.5 ml 
05 centrifuge tube for 3 0 seconds, quenched on ice, and the 
volume adjusted to 7 0ul with RNAse-free water. To this 
were added 20ul of buffer (0,25 M Tris pH 8,8 (8.2 at 
42°C) , 0.25 M KC1, 30 mM MgCl 2 >, 2ul of RNAse inhibitor 
(Boehringer 3 6 u/ul) , lul of 1M DTT, lul of 5ug/ul of 
10 oligo dT (Collaborative Research) , 2ul of 25 mM each 

deoxynucleoside triphosphate (US Biochemicals) , and 4ul 
of reverse transcriptase (Life Sciences, 24 u/ul) . After 
4 0 minutes at 42 °C, the reaction was terminated by 
heating to 7 0°C for 10 minutes. To the reaction mix was 
15 then added 320ul of RNAse free water, 80ul of buffer (0,1 
M Tris pH 7.5, 25 mM MgCl 2 , 0.5 M KC1, 0.25 mg/ml BSA, 
and 50 mM DTT) , 25 units of DNA Polymerase I 
(Boehringer) , and 4 units of RNAse H (BRL) . After 1 hour 
at 15°C and 1 hour at 22°C, 20 ul of 0.5M EDTA pH 8.0 
2 0 were added, the reaction mixture was extracted with 

phenol, NaCl was added to 0.5 M, linear polyacrylamide 
. (carrier; Strauss, F. et al. , Cell , 2Z :889 " 901 (1984)) 
was added to 20 ug/ml, and the tube was filled with, 
ethanol. After centrifugation for 2-3 minutes at 12,000 
2 5 x g, the tube was removed, vortexed to dislodge 

precipitate spread on the wall of the tube, and respun 
for 1 minute. 

Unpurified oligonucleotides having the sequence 
CTCTAAAG and CTTTAGAGCACA were dissolved at a 
30 concentration of 1 mg/ml, MgSO^ was added to 10 mM, and 
the DNA precipitated by adding 5 volumes of EtOH. The 
pellet was rinsed with 70% ETOH and resuspended in TE at 
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05 



10 



15 



a concentration of 1 mg/ml. 25 ul of the resuspended 
oligonucleotides were phosphorylated by the addition of 3 
ul of buffer C0.5-M Tris pH 7.5, 10 inM ATP, 20 mM DTT, mM 
spermidine, 1 mg/ml BSA, and 10 mM MgCl.,) and 2 0 units of 
polynucleotide kinase followed by incubation at 37°c 
overnight . 

3 ul of the 12-mer and 2 ul of the 8-mer 
phosphorylated oligonucleotides were added to the cDNA 
prepared as above in a 300 ul reaction mixture containing 

6 mM Tris P H 7.5, 6 mM MgCl.,, 5 mM NaCl, 0.35 mg/ml BSA, 

7 mM mercaptoethanol, 0.1 mM ATP, 2 mM DTT, 1 mM 
spermidine and 400 units T4 DNA ligase (New England 
BioLabs) at 15° overnight. 10 ul of 0.5 M EDTA were 
added, the reaction was phenol extracted, ethanol 
precipitated, resuspended in a volume of 100 ul and 
layered on a 5 ml gradient of 5-20% potassium acetate in 
1 mM EDTA, 1 ug/ml ethidium bromide. The gradient was 
spun 3 hours at 50,000 rpm (SW55 rotor) and fractionated 
manually, collecting three approximately 0.5 ml fractions 
followed by six approximately 0.25 ml fractions in micro- • 
centrifuge tubes by means of a butterfly infusion set 
inserted just above the curve of .the tube. Linear 
polyacrylamide was added to 20 ug/ml, the tubes were 
filled with ethanol, chilled, spun, vortexed and respun 

25 as above. The precipitate was washed with 70% ethanol, 
dried, and resuspended in 10 ul. i ul of the last 6 
fractions was run on a gel to determine which fractions 
to pool, and material less than Ikb in size was typically 
discarded. Remaining fractions were pooled and ligated 

30 to the vector. 

. The complete sequence and derivation of the vector 
is shown in Figure 5. The vector was prepared for 



20 
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cloning by digestion with BstXI and fractionation on 
5-20% potassium acetate gradients as described for the 
cDNA. The appropriate band was collected by syringe 
under 3 00 nm UV light and ethanol precipitated as above. 

0 5 cDNA and vector were titrated in test ligations. Usually 
1-2 ug of purified vector were used for the cDNA from 4 
ug of poly A+ RNA. The ligation reactions were composed 
as described for the adaptor addition above. The 
ligation reactions were transformed into MC1061/p3 cells 

10 made competent as described above. The transformation 
efficiency for supercoiled vector was 3-5xl0 8 
colonies/ug. 

Recovery and characterization of the CD2 8 clone . Panning 
of the library was carried out as described herein above, 
15 using purified antibody 9.3 (DuPont) at a concentration 
of 1 ug/ml in the antibody cocktail. The methods used 
for COS cell transfection, radioimmunoprecipitation, RNA 
and DNA blot hybridization, and DNA sequencing were all 
as described herein above, 
20 To isolate the CD2 8 cDNA, a large plasmid cDNA 

library was constructed in a high efficiency expression 
vector containing an SV40 origin of replication. A 
preferred version of the vector, containing an M13 
origin, is shown in Figure 6. Three features of the 
25 vector make it particularly suitable for this use: (i) 
the eukaryotic transcription unit allows high level 
expression in COS cells of coding sequences placed under 
its control; (ii) The small size and particular arrange- 
ment of sequences in the plasmid permit high level 
30 replication in COS cells; and (iii) the presence of two 
identical Bst XI sites in inverted orientation and 
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separated by a short: replaceable fragment allows the use 
of an efficient oligonucleotide-based strategy to promote 
cDNA insertion in the vector. 

The BstXI cleavage site, CCAN^NTGG, creates a four 

05 base 3 1 extension which varies from site to site* A 
vector was created in which two identical sites were 
placed in inverted orientation with respect to each 
other, and separated by a short replaceable segment of 
DNAo Digestion with BstX I followed by removal of the 

10 replaceable segment yielded a vector molecule capable of 
ligating to fragments having the same ends as the 
replaceable segment r but not to itself • In parallel, 
cDNA synthetic oligonucleotides were employed that give 
the same termini as the replaceable segment. The cDNA 

15 then could not ligate to itself, but could ligate to the 
vector o In this way, both cDNA and vector were used as 
efficiently as possible <» 

Tailing with terminal transferase achieves the same 
end, but with less convenience and less overall 

2 0 efficiency . Moreover, homopolymer tracts located 5 9 to 

cDNA inserts have been reported to inhibit expression in 
vitro and invivo (Yokota, T., et al . r Nucl> Acids Res. 
14s 1511-1524 (1986); Riedel, H. , EMBO J, 3s 1477-1483 

(1985) ). Similar approaches based on the use of 

25 partially filled restriction sites to favor insertion of 
genomic DNAs (Zabarovsky, EoR. , et al . , Gene 42 : 119-123 

(1986) ) and cDNAs (Yang, Y*, et al . , Cell 47 :3-10 (1986)) 
recently have been reported. These approaches give 2 or 
3 base complementary termini , which usually ligate less 

3 0 efficiently than the 4 base extensions reported here. 

Although the cloning scheme of the. present invention 
does not result in a directional insertion of the cDNA, 
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the ability to make large libraries easily, coupled with 
a powerful selection procedure, mades directional 
insertion unnecessary- The library construction 
efficiencies observed according to the present invention, 
05 between 0.5 and 2xl0 6 recombinants per ug of hlRNA, with 
less than 1% background and an insert size greater than 
lkb, compared favorably with those described for phage 
vectors lambda gtlO (7.5 X 10 5 /ug of mRNA) and lambda 
gtll (1.5 X 10 6 /ug of mRNA) (Huynh, T. , et al . , , In : DNA 
10 Cloning Vol. I, A Practical Approach , Glover, D.M. (ed.) , 
IRL Press, Oxford (1985), pp. 49-78); but the resulting 
clones were more convenient to manipulate* 

Surface antigen CDNAs can be isolated from these 
libraries using the antibody enrichment method of the 
15 present invention. In this method, the library is 

introduced into COS cells (for example, by spheroplast or 
protoplast fusion) , where it replicates and expresses its 
inserts. The cells are harvested by detaching without 
trypsin, treated with monoclonal antibodies specific for 
2 0 the surface antigens desired, and distributed in dishes 
coated with affinity purified antibody to mouse 
immunoglobulins • Cells expressing surface antigen 
adhere, and the remaining cells can be washed away. From 
the adherent cells, a Hirt fraction is prepared (Hirt, 
25 B., J. Molec. Biol. 2JZz365-369 (1967)), and the resulting 
DNA transformed back into E. coli for further rounds of 
fusion and selection. Typically, after two rounds of 
selection with monoclonal antibodies recognizing 
different surface antigens, a single round of selection 
30 is performed with a single antibody, or pool of 
antibodies recognizing the same antigen. 
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Isolation of a CD28 cDNA . The CD28 cDNA was isolated 
from a library of about 3 X 10 5 recombinants prepared 
from cDNA from O.Sug of poly A + RNA using an earlier 
version of the protocol described in the Materials and 

05 Methods- The library was screened for CD28 (and other 
surface antigen) cDNA clones by the method outlined 
above. After the third transfection, COS cells were 
panned with the 9„3 antibody alone. A Hirt supernatant 
was prepared from the adherent cells and transformed into 

10 E. coli . Plasmid DNA was isolated form eight colonies 

and transfected individually into COS cell cultures. The 
presence of the DC28 antigen was detected in three of 
eight transfected cultures by indirect 

immunofluorescence. All three plasmid DNAs contained an 
15 insert of about 1-5 kbo 

cDNA sequence analysis « The CD28 cDNA encodes a long 
open reading frame of 220 residues having the typical 
features of an integral membrane protein (Figure 17) . 
Removal of a predicted (von Heijne, Nucl. Acids Res. 

20 14x4683-4690 (1986)) N-terminal signal sequence gives a 
mature protein of 202 residues comprising an 
extracellular domain with five potential N-linked 
glycosylation sites (Asn-X-Ser/Thr) , a 27-amino acid 
hydrophobic membrane spanning domain, and a 41-amino acid 

25 cytoplasmic domain- Comparison of the amino acid 

sequence of CD28 with the National Biomedical Research 
Foundation database (Version 10*0) revealed substantial 
homology with mouse and rabbit immunoglobulin heavy-chain 
variable regions over a domain spanning almost the entire 

30 extracellular portion of CD28o Within this domain two 
cysteine residues in the homology blocks Leu-(Ser or 



WO 89/08114 



PCT/US88/02809 



-71- 

Thr)-Cys and Tyr-(Tyr or Phe)-Cys are shared by CD28, 
CD4 , CD8 , immunoglobulin heavy- and light-chain variable 
sequences and rela-ted molecules with approximately the 
same spacing (Maddon et al . , Annu. Rev. Biochem. 
05 48:961-997 (1979)). 

CD2 8 cDNA directs the production of a homodimer in 
transfected COS cells . Immunoprecipitation of CD2 8 
antigen from transfected COS cells was carried out using 
the monoclonal antibody 9.3 (Hansen, J. A. , et al . , 
10 Iimnunogenetics lJD:247-260 (1980)). The material obtained 
from COS cells migrated with a molecular weight of 74kd 
under nonreducing conditions and 3 9 3cd under reducing 
conditions, a pattern consistent with homodimer 
formation. Under the same conditions activated T cells 
15 give bands with molecular weights of 87 and 4 4kd, and 
HPB— ALIj cells give bands of 92 and 50 led, under 
nonreducing and reducing conditions respectively. The 
variation in molecular weight of the material obtained 
from different cell types arises as a result of differing 

2 0 glycosylation patterns characteristic of each type. 

Similar results were observed with other leukocyte 
surface antigens (Seed et al , Proc. Natl. Acad. Sci USA 
87 (1987)). The nucleotide sequence of the CD28 cDNA 
predicts a mature protein with molecule weight of 23 kd, 
25 much smaller than observed in these experiments, and 

probably attributable to utilization of the 5 N-linked 
glycosylation sites predicted by the amino acid sequence. 

RNA blot analysis . Equal amounts of total RNA prepared 
from cell types expressing or lacking CS28 were subjected 

3 0 to RNA blot analysis as described hereinabove. Four 
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bands with molecular weights of 3.7, 3.5, 1.5, and 1.3 kb 
were visible in lanes containing RNA thymocytes, T 
blasts, senescent T cells, and the T cell leukemia cell 
lines PEER and HPB-ALL. No bands were detected in lanes 
containing RNA prepared from the cell lines U937 
(histiocytic leukemia) , HuT-78 (Adult T cell leukemia) , 
Jurkat (T cell leukemia) , Namalwa (Burkitt lymphoma) , 
MOLT4, and HSB-2, all of which do not express CD28. The 
1-5 kb transcript presumably corresponds to the isolated 
cDNA, and the 3.7 and 3.5 kb species reflect incomplete 
splicing or alternative polyadenylation site utilization. 
The 1.3 kb transcript may terminate at an unconventional 
polyadenylation signal, since there is no obvious 
candidate in the sequence. 

15 The CD28 gene is not rearranged . DNA blot analysis (Seed 
et al ° ' Proc. Na tl. Acad. Sci USA 87 (1987) ) of genomic 
DNA from placenta, peripheral blood lymphocytes, T cells, 
HeLa cells, or the tumor lines used in the RNA blot 
analysis above showed identical Dra 1 digest patterns 
indicating that rearrangement is not involved in the 
normal expression of the CD28 gene during development. 
Similarly, no gross genomic rearrangement underlies the 
failure of the examined T-cell tumor lines to express 
CD28 antigen. It may be inferred from the Dra 1 fragment 
pattern that the CD28 gene contains at least two introns. 

EXAMPLE IV Isolation and Molecular Cloning of Two 

Human CD7 Antigen cDNAs 



20 



25 



The CD7 cluster of antibodies (Palker, et al. , 
Leukocyte Typing II. Springer-verlag, New York, 303-313 
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(1985)) recognized a 40 kd glycoprotein (gp40) on the 
surface of peripheral blood T cells and thymocytes. 
Early studies with anti -CD7 antibodies showed that CD7 + T 
cells enhance immunoglobulin (Ig) synthesis by B cells 
05 (Miroshima et al. , J. Immunol . 129 : 1091-1098 1982)), 
suppress B cell Ig synthesis when stimulated with 
Concanavalin A (Haynes et al . Proc. Natl. Acad. Sci. 
U.S.A. 2^ :5a29 ~ 5833 (1979)) and are the precursors of the 
cytotoxic T cells generated in mixed lymphocytic culture 
10 (Morishima et al . , J . Immunol . 129 : 1091-1098 (1982)). 

Furthermore, CD 7 has been found to be the most reliable 
marker for the identification of T cell acute 
lymphoblastic leukemia (Link et al. , Blood 62 : 722-728 
(1983)). As such, studies have been carried out, in 
15 which cytotoxins coupled to the anti-CD7 antibody 3A1 
were used to purge bone marrow prior to reinfusion to 
avoid early relapse in autologous bone marrow transplants 
or as prophylaxis against graft vs. host disease in 
allogenic bone marrow transplants (Ramakrishnan et al . , 
20 Jo Immuol . 135 : 3616-3622 (1985)). Similarly, anti-CD7 

antibodies also show promise as Immunosuppressive agents 
in the treatment of allograft rejections (Raftery et al . , 
Transpl. Proc , 17:2737-2739 (1985)) which is in accord 
with the recent observation that the anti-CD7 antibody 
25 7G5 significantly inhibits the primary mixed lymphocyte 
reaction (Lazarovits et al. Leukocyte Typing III. 
Oxford Univ. Press, Oxford (1987)). 

At present the physiological role of CD7 is not 
understood. It is known that anti-CD7 antibodies are not 
30 mitogenic, and do not block the T cells' response to PHA, 
or tetanus toxoid (Palker et al. Leukocyte Typing . 
Springer-verlag, New York, 303-313 (1985))'. Some have 
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noted that expression of CD7 in thymocytes occurs prior 
to the onset of T cell receptor beta-chain rearrangement 
(Pittaluga et al. f - Blood , 68:134-139 (1986)) and have 
pointed to a possible role for CD7 in this rearrangement 

05 and subsequent expression of the T cell receptor. It is 
clear that the cloning of the CD7 antigen would further 
efforts to understand its role in T cell physiology o 
Nucleotide sequencing and preliminary characterization of 
two cDNAs encoding the CD7 antigen was carried out 

10 according to the method of the present invention- 

Prompted by the recent suggestion that CD7 may be, or be 
part of , the T cell IgM receptor (Sandrin et al- , 
Leukocyte Typing III. Oxford Univ. Press, Oxford (1987) ) , 
the ability of COS cells expressing CD7 to bind IgM or 

15 IgM immune complexes was evaluated. The results do not 
support the simple notion that CD7 itself is an IgM 
receptor o 

Preparation of cDNA library and recovery and 
characterization of CD7 clones 

20 Preparation of an HPB-ALL cDNA library in the 

expression vector piH3 was carried out as described 
herein. Panning of the library was carried out according 
to the method of the present invention, using purified 
anti-CD7 antibody Leu9 (Becton Dickinson) and antibody 

25 7G5 as ascites fluid was diluted 1/1000, Methods for 

cell trans f ect ion f radio immunoprecipi tat ion, DNA and RKA 
blot hybridization and DNA sequencing were all as 
described herein- 



30 



IgM and IqG binding by COS cells transfected with CD7 and 
CDW3 2 

Human Igm, IgG, and IgA antibodies, affinity 
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purified FITC conjugated goat anti-human immunoglobulins 
antibodies (anti-Ig (G+M+A) ) , washed and preserved bovine 
red blood cells, and IgG and IgM fractions of rabbit 
anti-bovine red blood cell antibodies were purchased from 
05 Cooper Biomedical (Malverne, PA) . COS cells were 
transfected by the DEAE Dextran method with cDNAs 
encoding the CD7 , CDw3 2, and CD2 8 surface antigens. 4 8 
hours after transfection the cells were washed with 
PBS/ 0.5% BSA and incubated with either human IgM, IgG or 
10 IgA antibodies at a concentration of lug/ml, at 4*C for 2 
hours. Subsequently the cells were washed with PBS/0.5% 
BSA and incubated for 30 minutes at 4°C with FITC 
conjugated rabbit anti-human immunoglobulins. After 
washing the cells were examined with a fluorescence 
15 microscope. The experiments were also performed in the 
presence of 0.1% azide with the same results. 

Bovine erythrocytes for rosette assays were prepared 
as described by Ercolani et al. , J. Immunol . 
127 :2044-2051 (1981). Briefly, a 2% suspension of bovine 
20 erythrocytes was washed with PBS/0.5% BSA and treated 
with subagglutinating amounts of either IgG or the IgM 
fraction of rabbit anti-bovine erythrocyte antibodies at 
4°C for 1 hour. Erythrocytes were then washed twice with 
PBS/0.5% BSA and adjusted to a 2% solution. 2 ml of 
25 antibody-coated erythrocytes were layered on 6 0mm dishes 
containing COS cells which had been transfected 48 hours 
earlier with either CD7, CD32 or CD28 by the DEAE Dextran 
method. The dishes were then centrifuged at 150 X g at 
4°C for 15 minutes. After an additional 45 minute 
3 0 incubation at 4°C, the plates were gently washed 5 times 
with 5 mis of PBS/0.5% BSA, and the COS cells were 
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examined for rosette formation. These experiments were 
also performed in the presence of 0.1% sodium azide 
without alteration of the results. 

Formation of T cell rosettes with antibody-coated 

05 e r yt hr o c y t e s 

Peripheral blood lymphocytes were obtained from 
heparinized blood by centrifugation at 4'C over a 
Ficoll-Hypague gradient at 400 x g for 3 0 minutes. 
Leukocytes at the interface were washed two times with 

10 PBS- The leukocytes were adjusted to 10Y7 cells/ml in 

IMDM/10% Fetal Bovine Serum (FBS) and incubated in tissue 
culture dishes at 37 °C for 3 0 minutes. Nonadherent cells 
were transferred to new dishes, and PHA was added to 
stimulate proliferation of T lymphocytes. On the next 

15 day the cells were washed with PBS and placed in fresh 
IMDM/10%FBSo 

Rosette assays were performed three days later. 
Cells were washed with PBS/0 * 5% BSA, and a 10 ul 
suspension of 2% Ig-coated erythrocytes prepared as 

20 described above was added to 10 ul of PBS/0.5% BSA 

containing 5 X 10 6 cells/ml. The mixtures were placed in 
Falcon round bottom 9 6 well plates and centrifuged at 15.0 
X g for 15 min at 4°C. After an additional incubation of 
45 min at 4°C pellets were resuspended with 10 ul of 

25 PBS/0 .5% BSA, and the rosettes scored by phase contrast 
microscopy. The experiments were carried out in both the 
presence and absence of 0.1% sodium azide with no 
detectable difference. 

Isolation of cDNAs encoding the human CD7 antigen . 
3 0 To isolate CD7 cDNAs, a large plasmid library was 

constructed in the expression vector H3M as describe 
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hereinabove. The library was introduced into COS cells 
by spheroplast fusion, and allowed to replicate and 
express its inserts. The COS cells were harvested by 
detaching without trypsin 4 8 to 7 2 hours after 
05 transfection, treated with monoclonal antibodies specific 
for surface antigens believed to be encoded in the 
library, and distributed in dishes coated with affinity 
purified anti-mouse antibody as described herein. Under 
these conditions, cells expressing surface antigen adhere 
10 and the remaining cells can be washed way. 

A Hirt (Hirt, J. Mol. Biol ., 2^:365-369 (1967)) 
fraction was prepared from adherent cells, and the 
resulting DNA transformed back into E. coli for further 
rounds of fusion and selection. In the third round of 
15 selection the detached cells were treated with a mixture 
of monoclonal antibodies specific for CD7 (765 and Leu9) , 
and a Hirt supernatant was again generated and 
transformed into E. coli . After transformation of the 
DNA into E. coli 8 colonies were picked, and the plasmid 
2 0 DNA prepared from them by an alkaline miniprep procedure 
(Maniatis, et al. , Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Press, Cold Spring Harbor, New 
York (1982)). DNA was prepared from 8 resulting colonies 
and transfected into COS cells. After 3 days, surface 
25 expression of the CD7 antigen was detected by indirect 
immunofluorescence in 7 of 8 transfected dishes. 
Restriction enzyme digest of the corresponding plasmid 
DNAs revealed two species. One contained a 1 . 2 kb 
insert, and the other a 1.3 kb insert. 
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CD7 cDNA sequence analysis 

Both, isolates were sequenced by the 
dideoxynucleotide method. The 1-2 kb cDNA encodes a long 
open reading frame of 240 residues having the* typical 

OS features of an integral membrane protein* The initial 
assignment of the signal sequence cleavage site by the 
method of von Heijne ( Nucl, Acids Res, 14:4683-4690 
(1986) ) was at the 18th residue. It later was 
determined, however, that the homology with 

10 immunoglobulin variable regions would better predict the 
mature terminus at residue 26; this assignment would also 
correlate well with the position of the intron as 
discussed below and as shown in Figure 8. Removal of the 
predicted N-terrainal signal sequence gives a mature 

15 protein of 215 residues with a predicted molecular mass 
of 23kdo In the extracellular domain are two N-linked 
glycosylation sites (Asn-X-Ser Thr) , in agreement with 
the results of Sutherland et al. ( Jo Immunol, 133 :327-333 
(1984)), who also showed the presence of O-linked glycans 

20 and covalently associated palmitic acid on the mature 
protein o In the 27 amino acid hydrophobic membrane 
spanning domain is a single cysteine residue which may be 
the site of fatty acylation (Rose et al. , Proc. Natl. 
Acad. Sci. USA 81:2050-2054 (1984); Kaufman et al . , J. 

25 Biol, Chem. 259 :7230-7238 (1984)). The length of the 

cytoplasmic domain, 3 9 residues, is in good agreement with 
the 3 0-40 amino acids predicted by protease digestion of 
the CD7 precursor in rough microsomal membrane fractions 
(Sutherland et al. , J. Immunol. 133 :327-333 (1984)). 

30 Sequence analysis of the 1.7 kb clone (Figure 8) 

revealed the presence of an intron located 121 bp from 
the 5 1 end. The 411 bp intron contains stop codons in 



WO 89/08114 



PCT/US88/02809 



all three reading frames and is located just downstream 
of the secretory signal sequence, as is frequently 
observed for secreted or surface proteins. Both the 5' 
and 3 ■ ends of the intron conform to the splice 
0 5 donor/acceptor consensus AAG GTRAGA/ . . ./Y- NYAG A 

6 — 11 

(Mount, Nucl. Acids Res. 10 :459-472 (1982)). Because 
both the 1.2 and 1.7 kb clones express CD7 antigen 
equally well in COS cells, the intron must be excised in 
COS cells fairly efficiently, 
10 Comparison of the amino acid sequence with the 

National Biomedical Research Foundation database revealed 
substantial homology with human and mouse immunoglobulin 
kappa chain and T-cell receptor gamma chain variable 
regions over almost the entire extracellular portion of 
15 the molecule. Two cysteine residues shared in 

approximately equal spacing by all three structures fall 
in the conserved sequences Ile-Thr-Cys and Tyr-X-Cys. In 
kappa chain variable regions these cysteins form a 
disulfide bridge. The presence of at least one 
2 0 intrastrand disulfide bond in the CD7 structure has 
previously been proposed by Sutherland et al. ( J. 
Immunol. 133 :327-333 (1984)), who noted that 
inununoprecipitation of CD7 gave rise to a band with an 
apparent molecular mass of 4 0 kd under reducing 

2 5 conditions and 3 8 kd under nonreducing conditions. 

Based on the homology with immunoglobulin V-regions, 
it is predicted that CD7 contains a disulfide bond 
linking Cys 23 and Cys 89. A second disulfide bond, 
linking Cys 10 and Cys 117, has been proposed, based on 

3 0 the structural similarity between CD7 and Thy-1. The 

extracellular domains of both Thy-1 and CD7 have 4 
cysteine residues, in roughly homologous positions. The 
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4 cysteine residues, in roughly homologous positions. 
The 4 cysteine residues of Thy-1 are joined in two 
internal disulfide' bridges between Cys 9-111 and Cys 
19-85 (Williams et al - , Science 216 : 696-703 (1982) )» In 

05 Thy-1, Cys 111 forms an amide bond with the ethanolamine 
moiety of a substituted phosphatidyl inositol, and is thus 
the last residue of the mature molecule (Tse et al . , 
Science 23£s 1003-1008 (985) ) o In CD7, Cys 117 is 
followed by four repeats of a sequence whose consensus is 

10 Xaa-Pro-Pro-Xaa-Ala-Ser-Ala-Leu-Pro, and which, it is 

proposed, plays the role of a stalk projecting the V-like 
domain away from the surface of the cell. 

In addition to the homologies shown in Figure 20 and 
mentioned above, the extracellular domain of CD7 has 

15 significant homology with both chains of the rat CD8 

heterodimer (Johnson et aL , Nature 323 :74-76 (1986)) , 
and the myelin P Q protein (Lemke et al. , Cell 40:501-508 
(1985) )o 

CD7 directs the production of a 4 0 kd protein in 

20 trans feet ed COS cells 

Immunoprecipitation of CD7 antigen from transfected 
COS cells was carried out as described herein using 
monoclonal antibody 7G5 (Lazarovits et al . , Leukocyte 
Typing- III , Oxford Univ<, Press, publisher, Oxford, 

25 England (1987) • The material obtained from COS cells 

migrated with as a broad band with molecular weight of 4 0 
kd under reducing conditions. Under the same conditions 
HPB-AIiL cells (the cDNA donor line) and activated T cells 
gave bands with molecular widths of 41 and 3 9 kd 

3 0 respectively o In both the COS cell and HPB-ALI, lane a 
faint band with molecular weight of 3 0 kd was also 
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observed, possibly corresponding to a partially 
glycosylated precursor (Sutherland, D.R., et al . , J. 
Immunol . 133 :327-333 (1984)). 

RNA blot analysis 

05 Equal amounts of total RNA prepared from cell types 

expressing or lacking CD7 were subjected to Northern blot 
analysis as described herein. A single 1.3 kb species 
was visible in lanes containing RNA from thymocytes, 
activated T cells, resting T cells, and the T cell 

10 leukemia lines HuT-78, HPB-ALL, Jurkat J3R7, HSB-2 and 

PEER. With the exception of the PEER cell line, none of 
the T cell tumors showed significant overexpression of 
CD7 transcripts. CD7 RNA was detected in all of the 
thymus-derived cells, but not in RNA from U937 

15 (histiocytic leukemia) and Namalwa (Burkitt Lymphoma) 

cellso No band corresponding to the 1.7 kb cDNA could be 
detected, suggesting that this species is artificially 
enriched during the cloning or library amplification 
process, 

2 0 Enrichment during amplification seems unlikely 

because the 12 kb cDNA clone propagates as well in E. 
coli as the 1.7 kb clone. However, immediately upstream 
and downstream from the site of insertion of the intron 
are sequences that could form an interrupted stem and 
25 loop structure. Eight of the 10 basepairs of the 

potential stem are GC pairs, perhaps giving the structure 
sufficient stability to interfere with elongation of the 
cDNA first strand. The presence of the intron greatly 
separates the two halves of the stem, potentially 

3 0 eliminating the structure via unfavorable loop entropy 

and allowing efficient first strand synthesis. 
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The CD7 c^ene is not rearranged 

Southern blot analysis of genomic DNA from placenta, 
peripheral blood lymphocytes, T cells, HeUa cells, or the 
tumor lines used in the RNA blot analysis aboVe showed 
05 identical Dra 1 digest patterns. Thus, the CD7 gene is 
not grossly altered during development, and the high 
level of expression in the PEER cell line is not the 
consequence of a substantial genomic rearrangement. 

cos cells expressing- CD7 do not bind IgM 

10 Human peripheral blood T lymphocytes express 

receptors for IgM antibodies (FcRu: Moretta et al . , Eur. 
J« Immunol. _5:565-569 (1975); McConnell et al. # Immunol . 
30:835-837 (1976)), Recently it has been reported that 
CD7 might play a role in IgM binding by T cells (Sandrin 

15 et al <, . Leukocyte Typing III , Oxford Univ. Press, 

publisher, Oxford, England (1987) )o L cells, normally 
CD7~ and FcRu", become CD7 + and FcRu + when trans feet ed 
with a 16 kb genomic fragment encoding the CD7 antigen 
(Sandrin et al, , Leukocyte Typing III , Oxford Univ. 

20 Press, publisher, Oxford, England (1987)), Furthermore, 
IgM binding to CD7-positive cells can be blocked by the 
anti-CD7 monoclonal antibody Huly-m2 (Thurlow et al, , 
Transplantation 38_s 143-147 (1984)), and IgM columns bind 
a 37 kd protein from radiolabeled ly sates of peripheral 

25 blood T lymphocytes (Sandrin et al , , Leukocyte Typing 
III, Oxford Univo Press, publisher, Oxford, England 
(1987) ) . 

Accordingly, COS cells expressing CD7 were tested 
for their ability to bind IgM. IgM receptor activity was 
30 assayed either by direct binding (Hardin et aL , Proc, 
Natl. Acad, Sci. USA 76:912-914 (1979)) or by a rosette 
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assay with ox erythrocytes coated with an IgM fraction of 
rabbit anti-bovine red cell serum as described by 
Ercolani et al. J. Immunol. 127:2044-2051 (1981)). Cells 
expressing CD7 neither bound human IgM nor formed 
rosettes with Igm-coated erythrocytes. Under the same 
conditions, COS cells transfected with a cDNA encoding 
the human IgG receptor CDw32 bound IgG directly and 
formed rosettes with IgG-coated erythrocytes. 
Erythrocytes coated with IgM or IgG antibodies also 
adhered to a fraction of peripheral blood lymphocytes as 
reported (Moretta et al. , Eur. J. Immunol. 5:565-569 
(1975) ) . 

These results do not support the notion that the CD 7 
antigen is by itself an IgM receptor, although they do 
15 not exclude the possibility that COS cells suppress IgM 
binding activity in some manner, or that CD7 is part of, 
or modified to become, an IgM receptor. That CD7 is not 
by itself an IgM receptor is supported by the observation 
that a number of CD7 + T cell lines are FcRu- (Sandrin et 

Leucocyt e Typing III . Oxford Univ. Press, publisher, 
Oxford, England (1987))'. 

EXAMPLE V Isolation and Molecular Cloning of the 

Human CDw32 Antigen 



20 



25 



30 



A cDNA encoding the human CDw3 2 antigen, a human 
receptor for immunoglobulin G constant domains (Fc 
receptor) , was isolated by the method of the present 
invention, by virtue of its affinity for its ligand, IgG. 
The sequence of the isolated clone is most closely 
related to the murine beta 2 Fc receptor, but has 
diverged completely in the portion encoding the 
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cytoplasmic domain. The receptor expressed in COS cells 
shows a preference for lgG 1 among IgG subtypes, and no 
affinity for IgM, -IgA or IgE. 

To isolate the Fc receptor clone, cDNA libraries 
05 were prepared from tumor cell lines or from a human tumor 
and transfected into COS cells. After 48 hours, the 
cells were treated with mouse or human IgG antibodies, 
and allowed to settle on dishes coated with 
affinity-purified sheep anti-mouse IgG or goat anti-human 
10 . IgG antibodies « After lysis, DNA recovery, and 

transformation in E ~ coli , the cycle was repeated for two 
more rounds. Although no positive clones were isolated 
from the tumor line libraries, a cDNA clone encoding an 
Fc receptor was isolated from a library prepared from a 
15 human adrenal tumor „ It has been discovered that many 
tumors are heavily infiltrated by macrophages and 
lymphocytes o Thus, tumor RNA may be a productive source 
in general for transcripts of human macrophages. 

By indirect immunofluorescence assay, the human 
20 receptor expressed on COS cells bound all mouse and human 
IgGs with relatively low affinity -10~ 7 M) , and a clear 
discrimination was noted among human antibodies for IgG^. 
Human IgM, IgA x IgA 2 , and IgE did not bind, nor did 
murine IgM or IgA. As expected, human Fc, but not Fab 
25 fragments, bound to the transfected cells. Among 

monoclonal antibodies donated to the Third International 
Workshop on Leukocyte Differentiation Antigens, three 
gave strong positive immunofluorescence: two (out of 
two) recognizing the Fc Receptor CDw3 2 determinant, and 
3 0 one (out of four) recognizing the CD23 (B cell IgE Fc 
receptor) determinant. Monoclonals recognizing the T 
c^ll/Macrophage Fc receptor antigen CD16 gave only weak 
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immunofluorescence comparable to that shown by control 
ascites . 

Radioimmunoprecipitation of transfected COS cells 
with CDw32 antibodies showed the presence of a single 

05 4 0kd species, comparable in size to the antigen 

recognized on the surface of the myeloid CDw32 + line 
HL-6 0, and to the less abundant antigen present on the 
histiocytic leukemia line U937. This result reinforces 
the notion that the isolated receptor is CDw3 2 , as the 

10 CD16 receptor is reported to be substantially larger 
(60-70kd). 

The nucleotide sequence of the isolated receptor 
(Figure 9) is highly homologous to that of members of the 
recently isolated murine receptor family, and most 
15 closely related to the murine beta 2 receptor by nucleic 
acid homology. Surprisingly, the murine beta 2 receptor 
is found on T and B lymphocytes and macrophages, while 
the alpha receptor is restricted to macrophages; in the 
human system, CDw3 2 (shown here to be beta 2 ~like) is 
20 restricted to macrophages while another Fc receptor 
(CD16) is found on lymphocytes and macrophages. The 
human sequence appears to have diverged from the mouse 
sequence by insertion of approximately lkb of DNA a few 
bases 3" to the junction between the transmembrane and 
25 cytoplasmic domains. The junctions of the insertion site 
do not show obvious relationships to splice donor and 
acceptor sequences. Comparison of the human and murine 
peptide sequences showed that the peptide sequence 
diverges at the end of the transmembrane domain, before 
3 0 the nucleotide sequence diverges, suggesting the 

existence of a selective pressure favoring the creation 
of a differenct cytoplasmic domain. 
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RNA blot analysis showed that myeloid but not 
lymphocytic cell lines expressed RNA homologous to the 
CDw32 probe. DNA blot analysis showed multiple bands 
consistent with the existence of a small multigene 
05 family, 

EXAMPLE VT Isolation and Molecular Cloning of Two 

cDNA Clones Encoding the B Lymphocyte- 
specific CD20 (Bl, Bp3 5) Antigen 



10 



Recent studies suggest that the pan B cell antigen 
CD20 (Bl, Bp35) plays an important role in B cell 
activation * Monoclonal antibodies (mAb) to CD2 0 induce 
different cellular responses depending on the antibody 
used and the stage of differentiation or activation of 
the target B cells* The monoclonal antibody 1F5 
15 activates resting B cells by initiating the transition 
from the G Q to the phase of the cell cycle, and 
induces dense tonsillar B cells to proliferate (Clark et 
al^, Proc Natlo Acad. Sci USA 82S1766 (1985); Clark and 
Shu, Jo Immunol o 138 :720 (1987)). However, 1F5 does not 
20 induce an increase in cytoplasmic free calcium and does 
not induce circulating B cells to proliferate 
(Rabinovitch et al. , In: Leukocyte Typing III 
(McMichael, Ed,) r p. 435, Oxford University Press 
(1987))/ Other anti-Cd20 mAbs, such as Bl, have been 
2 5 shown to block B cell activation (Tedder et al. , J. 

Immunol. 13 5:973 (1985)) and both 1F5 ahd Bl can inhibit 
B cell differentiation (Golay et al. , J , Immunol. 
135:3795 (1985) ) „ Recently it has been suggested that 
phosphorylation and internalization of CD2 0 may be 
necessary steps for B cell entry into the a phase of the 
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cell cycle (Valentine et al. , In: Leukocyte Typing- III 
(McMichael, Ed.), p. 440, Oxford University Press 
(1987)). In the present example, two CD20 cDNA clones 
were isolated and expressed using the methods 'of the 
0 5 present invention. 
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15 



Preparation of cDNA Library and Recovery of cDNA Clones 
by Panning 

Poly (A) + RNA was prepared from the human Burkitt 
cell line Daudi by oli go (dT) cellulose chromatography of 
total RNA isolated by procedures described herein. cDNA 
preparation and expression library construction were 
carried out as described. 

Anti CD20 mAbs 1F5 , 2H7 , Bl, L27, G28-2, 93-1B3, 
B-Cl, and NU-B2 were obtained from the International 
Leukocyte Typing Workshop (Valentine et al. , In: 
Leukocyte Typing III (McMichael, Ed.), p. 44 0, Oxford 
University Press (1987)). Purified mAbs were used at a 
concentration of lug/ml and ascites were used at a 
dilution of 1:1000. Panning was done according to the 
20 present method. In the first round of screening, eight 
10cm dishes of 50% confluent COS cells were transfected 
by the DEAE-Dextran method. Subsequent screening cycles 
were performed by spheroplast fusion. 

Immuno precipitation , Sequencing. RNA and DNA Blot 
25 Hybridization 

B cell lines CESS and Daudi were metabolically 
labeled with 35 S-methionine and 35 S-cysteine for 6h at 
37 «C. COS cells transfected by the DEAE-Dextran method 
were similarly labeled 36 hours post-transfection . The 
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labeled cells were incubated with the Bl mAb (Coulter) at 
4°C for Ih, washed in PBS, and lysed with 0.5% NP-40, 
0.1% SDS, 0.05% debxycholate and ImM PMSF in PBS . After 
centrifuging (13000xg, 5 minO, the lysate was incubated 

05 with fixed S . aureus cells (Calbiochem) for 1 hr at 4°C. 
The S o aureus cells were pelleted , washed 5 times with 
l%NP-40/PBS, eluted and electrophoresed through 12-5% 
polyacrylamide gels. 

DNA and RNA blot analysis and hybridization probe 

10 preparation were carried out as described. Sequencing 

was done by the method of Sanger et al. Proc, Natl. Acad. 
Scio USA 74:5463 (1977)). The nucleotide sequence of the 
CD20»4 cDNA is represented in Figure 10. 

Two cDNA clones, bearing inserts of 1.5 (CD2 0.4) and 

15 1.0 kb (CD20.6), were isolated from a Daudi cell DNA 
library by panning with a panel of mAbs against CD20. 
COS cells transfected with either clone reacted with all 
members of the panel of antibodies. Immunoprecipitation 
of the cDNA-encloded protein from transfected COS cells 

20 showed two bands of 32 and 30kd reminiscent of the 37 and 
35kd bands observed in different B cell subsets and lines 
(Valentine et al- , "Structure and Function of the B Cell 
Specific 35-37kDa CD20 Protein," In: Leukocyte Typing 
III, A. McMichael et al . , eds. , Oxford University Press, 

25 p. 440 (1987) ) 0 It has been the experience of the 

present inventors that the molecular masses of surface 
antigens expressed in COS cells are consistently smaller 
than those of their native counterparts. This may be due 
to differences in glycosylation. 

30 ' Both cDNA clones have the same coding sequence, and 

differ only in the 3' untranslated region. The insert in 
clone CD20.6 has a short polyA tail and lacks a consensus 
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polyadenylation signal, while the insert in CD20.4 lacks 
a polyA tail and extends 4 31bp beyond the 3« terminus in 
CD20. 6 (Fig. 10A) .' 

RNA blot analysis showed that three transcripts of 
0 5 3.8, 3.0 and 1.5 Jcb were present in B cells but absent 
from other cell types, in agreement with the known 
pattern of antibody reactivity (Clark et al . , Proc. Natl. 
Acad. Sci. USA 82:1766 (1985); Clark et al , J. Immunol. 
138:720 (1987); Tedder et al . , J. Immunol. 135:973 
10 (1985); Golay et al. , J. Immunol. 135 :3795 (1985)). It 
appears likely that the Cd20.6 clone is derived from the 
1.5 kb transcript or possibly from an even shorter, 
undetectable species. Because the CD2 0.4 clone lacks a 
poly (A) tail, its source cannot be inferred at present. 
!5 DNA blot analysis showed that the CD2 0 genomic 

sequences are not rearranged during development and are 
not amplified in the cell lines examined. A restriction 
fragment length polymorphism was observed in a DNA sample 
obtained from placenta. 
20 . The amino acid sequence predicted by the cDNA 

contains 297 residues and has a molecular mass of 33,097 
daltons. The sequence contains three major hydrophobic 
stretches involving residues 51-103, 117-141 and 183-203 
(Fig. 10). Two other notable characteristics are the 
25 absence of an amino-terminal signal peptide and the 

presence of a highly charged carboxy-terminal domain. A 
polyclonal anti-CD2 0 antibody that recognized the last 18 
residues of the carboxy-terminus reacts with lysates of 
cells expressing CD20 but not with intact cells, 
30 suggesting that the CD2 0 carboxy terminus is located 

within the cytoplasm. Since there is no amino-terminal 
signal peptide, it is likely that the amino-terminus is 
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also intracellular, and. that the first hydrophobic region 
acts as an internal membrane insertion signal (Zerial et 
a3 -- . EMBO J. 5:1543 (1986)). The first hydrophobic 
region is composed of 53 residues and is therefore long 
05 enough to span the membrane twice if organized, as an 

alpha helix. Because there are two remaining hydrophobic 
regions, the intracellular side. Alternatively, the 
carboxy-terminus requires that the first hydrophobic 
domain exit the membrane on the intracellular side. 
Alternatively, the carboxy-terminal antibody may only 
recognize epitopes exposed by detergent treatment 
allowing the carboxy-terminus to be extracellular and 
forcing the first hydrophobic domain to exit the membrane 
on the extracellular side. The sequence contains 2 
15 potential N-glycosylation sites (Asn-Xaa-Ser/Thr , where 
Xaa cannot be Pro (Bause, Biochem. J. 209:331 (1983)) at 
positions 9 and 293, but neither of these is expected to 
be used if located in intracellular domains of the 
molecule. The difference in molecular mass between CD20 
20 expressed on COS cells and on B cells is therefore 

presumably due to 0-linked glycosylation, although other 
forms of post-translational modification are not 
excluded. If the carboxy-terminus is intracellular, the 
only extracellular domain would lie between residues 142 
25 and 182. This region is rich in serine and threonine 
residues which might support 0-glycosylation. 

The observation of two protein species in COS cells 
cannot be explained by alternate splice formation because 
the cDNA sequence does not contain any promising splice 
donor or acceptor sequences (Shapiro et al. , Nucl. Acids 
Res^ 15:7155 (1987)). A difference in glycosylation or 
alternate translational initiation site selection may 
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account for the two species observed. Initiation at 
either the first or the second ATG gives protein 
molecular masses of 33.1 and 30.8 kd respectively, in 
good agreement with the sizes observed in COS cells. 
0 5 Neither ATG is embedded in the consensus sequence 

proposed by Kozak ( Nucl. Acids Res. 12:857 (1984)). Use 
of alternate initiation sites has been reported for 
several proteins (Kozak, Nucl. Acids Res. 12 :857 (1984)). 
Comparison of the peptide sequence with the 
10 sequences in the National Biomedical Research Foundation 
database showed no significant homology by the FAS TP 
rapid sequence alignment algorithm. Because the bulk of 
the protein appears to be confined to the interior of the 
membrane and the cell, it seems plausible that it may 
15 play a role in transducing signals from other 

transmembrane proteins to the cell interior. Consistent 
with this role is the relatively hydrophilic nature of 
the hydrophobic regions which might allow hydrogen bond 
interactions with the transmembrane portions of other 
20 proteins. 

EXAMPLE VII Isolation and Molecular Cloning of I CAM, 

An Adhesion Liqand of LFA-1 

Antigen-specific cell contacts in the immune system 
are strengthened by antigen-non-specific interactions 

25 mediated in part by lymphocyte function associated or LFA 
antigens (Springer, T.A. , et al. , Annu. Rev. Immunol. 
5:223-252 (1987); Anderson, D.C., et al. , Annu. Rev. 
Medicine 5:175-194 (1987)). The LFA— 1 antigen, a major 
receptor of T cells, B cells and granulocytes (Rothlein, 

50 R., et al. , Exp. Med. 163 :1132-1149 (1987)), is involved 
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in cytolytic conjugate formation, antibody -dependent 
killing by NK cells and granulocytes, and helper T cell 
interactions. LFA-1 has been placed in the integrin 
family of cell surface receptors by virtue of the high 
05 sequence similarity between the LFA-1 and integrin beta 
chains (Kishimoto , T.K., et al . , Cell 48 : 681-690 (1987); 
Hynes, R.o. Cell 48:549-554 (1987) ). The adhesion 
ligands of the integrin family are glycoproteins bearing 
the Arg-Gly-Asp (RGD) sequence motif, e.g-, fibronectin, 
10 fibrinogen, vitronectin and von Willebrand factor 
(Ruoslahti, E. , et al. , Cell 44 :517-518 (1987))o 
In this example, the Intercellular Adhesion 
Molecule-1 (ICAM-1) , a ligand for LFA-l (Rothlein, R. , et 
al> , J- Immunol o 137 :1270-1275 (1986)? Dustin, M.L., et 
15 al » J« Immunol o 137 :2:45-254 (1986)), was cloned according 
to the methods of the present invention- ICAM contains 
no RGD motifs, and instead is homologous to the neural 
cell adhesion molecule NCAM (Cunningham, B.A. r et al. 
Science 236:799-806 (1987); Barthels, Do , et al. , EMBO J. 
20 6:907-914 (1987)), COS cells transfected with the ICAM 
cDNA clone bind myeloid cells by a specific interaction 
which can be blocked by monoclonal antibodies directed 
against either LFA-l'or ICAM-1- 

A cDNA library was constructed using RNA prepared 
25 from H]L60 cells induced with phorbol myristyl acetate 
(PMA) . The library was transfected into COS cells and 
cells expressing surface antigens were recovered 
according to the methods of the present invention by 
panning with the anti-ICAM monoclonal antibodies (mAbs) 
30 8F5 and 84H10 (McMichael, A.J. , et al. , eds., Leukocyte 
Typincr III. White Cell Differentiation Antigens , Oxford 
University Press (1987)). Episomal DNA was recovered 
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from the panned cells and the expression-panning cycle 
repeated a further 2 times to obtain a cDNA clone 
designated p ICAM-1 . 

COS cells transfected with pi CAM— 1 gave positive 
0 5 surface immunofluorescence reactions with three 
anti-ICAM-1 antibodies: 8F5 ; 84H10; and RR-1. 
Immunoprecipitation of pICAM-l-transfected COS cells with 
the mAb 84H10 gave a band of molecular mass lOOkD. 30) . 
A slightly larger protein of 110 kD was precipitated from 
10 HL60 cells induced for 48 hours with either phorbol 
myristyl acetate (PMA) , gamma-interf eron (gammalFN) , 
tumour necrosis factor (TNF) , or interleukin-1 beta (IL-1 
beta) , but was absent from uninduced cells. The smaller 
molecular mass of ICAM-1 expressed in COS cells is 
15 consistent with, the lower molecular masses observed for 
other surface antigens expressed in COS cells. 

RNA blot analysis showed 2 species of 3.2 kb and 1.9 
kb present in HL60 cells stimulated with either PMA, 
gamma IFN, TNF or IL-1 gamma, but absent in uninduced 
2 0 cells. Thus, the expression of ICAM-1 is regulated by a 
number of cytokines, apparently at the level of 
transcription. Similar species were present in B cells 
(JY and Raji), T cells (Peer and T blasts) and Lymphokine 
Activated Killer cells (LAK) . The structure of these 
25 ICAM-1 transcripts and their relationship to the pICAM-1 
cDNA remains to be established. Blot hybridization of 
genomic DNA from placenta revealed a pattern consistent 
with a single copy gene. 

To investigate whether pICAM-1 encodes a functional 
30 cell adhesion molecule, COS cells expressing ICAM-1 were 
tested for their ability to bind HL60 cells. After 30 
minutes at 37 °C in the presence of Mg 2+ , HL60 cells 
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strongly adhered to the ICAM-expressing COS cells, but 
not to mock transfected cells. The specificity of this 
adhesion was demonstrated by preincubating the ICAM-l 
expressing COS cells with mAb 84H10. All HL60 binding 

05 was abolished under these conditions. An isotype matched 
monoclonal antibody, W6/32, which recognizes a 
monomorphic HLA-ABC related determinant of approximately 
equal abundance to ICAM-l on transfected COS cells, had 
no effect on the adhesion. Similarly, preincubation of 

10 the HL60 cells with either 84H10 or W6/32 did not inhibit 
binding. 

To determine if LFA-1 was acting as the receptor for 
ICAM-l in this system, HL60 cells were pretreated with 
antibodies against the beta chain of LFA-l (CD18 

15 (McMichael, A.J. , et al. , eds. , Leukocyte Typing III. 
White Cell Diffe rentiation Antigens . Oxford University 
Press (1987))) and then subjected to the binding assay. 
All adhesion to ICAM-expressing COS cells was blocked. 
Pretreatment of COS cells with the CD18 antibodies had no 

20 effect on the adhesion. This provides direct evidence 
that ICAM-l is indeed acting as an adhesion ligand for 
LFA-l. 

The sequence of the pICAM-1 cDNA insert consists of 
1846 nucleotides (Fig. 11) . The predicted peptide 

25 sequence of 532 residues has the typical features of a 
transmembrane protein including a putative signal 
sequence, which may be cleaved between glycine-25 and 
asparagine-26 (von Heijne, G. , Nucl. Acids Res. 
14:4683-4690 (1986)), and a single 25 residue 

30 membrane-spanning domain terminating in a short, highly 
charged cytoplasmic domain. The extracellular domain 
contains seven potential N-linked glycosylation sites 
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which could adequately explain the difference in size 
between the deglycosylated precursor (55 JcD) and the 
final product (90-115 kD) (Dustin, M.L., et aL , J. 
Immunol. 137 :245-254 (1986)). Differential use of these 
05 putative glycosylation sites could also explain the 
heterogeneous molecular mass of ICAm-1 observed in 
different cell types (Dustin, M.L., et al. f J. Immunol. 
137 :245-254 (1986) ) . 

LFA-1 is a member of the integrin family of cell 
10 surface receptors (Kishimoto, T.K. , et al . , Cell 

£8: 681-690 (1987) ; Hynes , R.O., Cell 48 :549-554 (1987)). 
The tripeptide motif Arg-Gly-Asp (RGD) is a common 
feature of the ligands for this family, e.g., 
fibronectin, fibrinogen, vitronectin and von Willebrand 
15 factor, and is crucial for ligand-receptor interaction 

(Ruoslahti, E, et al. , Cell 44 :517-518 (1987)). However, 
ICAM-1 contains no RGD motifs, bearing instead a single 
RGE sequence at position 152. A search of the National 
Biomedical Research Foundation (Dayhoff, M.O., et al . , 
20 Methods Enzymol. 91:524-545 (1983)) (NBRF) database 

revealed no significant similarities to other proteins. 
However, a comparison to a laboratory database containing 
recently published surface proteins did reveal a 
surprising and significant similarity between ICAM-1 and 
25 the neural cell adhesion molecule NCAM-1 (Cunningham, 

B.A., et al. , Scienc 236 :799-806 (1987); Barthels, D. , et 
al^, EMBO *J. 6:907-914 (1987)). The optimal alignment ™~ 
score obtained using the NBRF ALIGN program is 3 standard 
deviations above the mean score obtained from 500 random 
3 0 permutations of the sequences. The probability of the 
spontaneous occurrence of an equal or higher score is 
approximately lo" 9 . 
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Using a database of known immunoglobulin related 
sequences, it has been shown that ICAM-1 may be divided 
into five Ig domains (28-112, 115-206, 217-310, 312-391, 
and 399-477) each of which shows significant similarity 
with other members of the Ig superfamily (Williams, A.F. , 
Immunol. Today 8:298-3-3 (1987)). For example, domain I 
is similar to CD3 whilst domains IV and V are similar to 
domains of myelin associated glycoprotein (Arguint, m. , 
et al. , Proc. Natl. Acad, Sci. USA 84:600-604 (1987)) and 
10 carcinoembryonic antigen (Beauchemin, N. , et al. , Mol» 

Cell. Biol. 7:3221-3230 (198.7)). All five Ig domains of 
NCAM align with the Ig segments in I CAM, and the 
principal contribution to the similarity comes from 
domains II and III of ICAM. Finally, the T cell-specific 
15 adhesion molecule CD2 shows roughly the same similarity 
to NCAM as does ICAM, but ICAM and CD2 are only weakly 
related o Thus, some precursor of NCAM is ancestral to 
both ICAM and CD2. 

The availability of a functional ICAM-1 cDNA will 
20 allow a better assessment of the role of ICAM-l/LFA-1 

mediated adhesion in antigen-specific leukocyte function, 
including T-cell mediated killing, T-helper responses and 
antibody -dependent cell mediated killing, 

EXAMPLE VXII Isolation and Molecular Cloning of the 
25 Human CD19, CD20, CD22, CDw32a, CDw32b, 

and CD40 Antigens 

The rapid immunoselection cloning method of the 
present invention was applied to isolate and clone the 
CD19, CD20, CD22, CDw32a, CDw32b, and CD40 antigens. The 
30 nucleotide sequence of CD19is shown in Figure 12. The 
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nucleotide sequence of CD20 is shown in Figure 13. The 
nucleotide sequence of CD22 is shown in Figure 14. The 
nucleotide sequence of CDw32a is shown in Figure 15. The 
nucleotide sequence of CDw32b is shown in Figure 16. The 
05 nucleotide sequence of CD40 is shown in Figure 17. 

Equivalents 

Those skilled in the art will recognize or be able 
to ascertain, using no more than routine experimentation, 
many equivalents to the specific embodiments of the 
L0 invention described herein. Such equivalents are 

intended to be encompassed within the scope of this 
invention. 



WO 89/08114 



98 



PCT/US88/02809 



CLAIMS 

l. A method for cloning a cDNA encoding a cell surface 
antigen , comprising i 

(a) preparing a cDNA library; 

(b) introducing said cDNA library into mammalian 
tissue culture cells; 

(c) culturing said cells under conditions allowing 
expression of said cell surface antigen by said 
cells; 

(d) exposing said cells to a first antibody directed 
against said cell surface antigen selected from 
the group consisting of CDla, CDlb, CDlc, CD6 f 
CD13, CD14, CD16, CD19 , CD20, CD22, CD26, CD27 , 
CD31, CDw32a, CDw32b, CD33, CD34, CD36, CD37 , 
CD38, CD3 9, CD4 0 f CD43, I CAM, FcRIa, FcRIb, 
TLiSa, and Lev- 8 ; 

(e) thereafter exposing said cells to a substrate 
coated with a second antibody directed against 
said first antibody, thereby causing said cells 
to adhere to said substrate; and 

(f) separating adherent from non-adherent cells. 

2- The method of claim 1 wherein said cDNA library is 
prepared from RNA extracted from a human cell line, 

3. The method of claim 2, wherein said cell line is 
selected from the group consisting of HPB-ALL and JY- 

4. The method of claim 1, wherein said cDNA library is 
prepared from RNA extracted from human tumor cells. 

5 . The method of claim 4 , wherein said human tumor cell 
is a human adrenal tumor cell. 
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6. A cloned cDNA segment encoding a cell surface antigen 
selected from the group consisting of CDla, CDlb, 
CDlC, CD6, CD13, CD14 , CD16, CD19 , CD20, CD22, CD26, 
CD27, CD31, CDv32a, CDw32b, CD33, CD34, CD36, CD37, 
CD3 8, CD3 9, CD4 0 , CD4 3, I CAM, FcRIa , FcRIb, TLiSa, and 
Lev-8 , or a functional derivative thereof. 

7. A substantially pure protein selected from the group 
consisting of CDla, CDlb, CDlc, CD6 , CD13, CD14, CD16, 
CD19, CD20, CD22, CD26, CD27, CD31, CDw32a, CDw32b, 
CD33, CD34, CD3 6 , CD37, CD38, CD39, CD40, CD43, ICAM, 
FcRIa, FcRIb, TLiSa, and Lev-8, or a functional 
derivative thereof . 

8. Substantially pure CDw32a antigen, or a functional 
derivative thereof . 

9. Substantially pure CDw32b antigen, or a functional 
derivative thereof „ 

10. Substantially pure CD2 0 antigen, or a functional 
derivative thereof . 

11. Substantially pure CD19 antigen, or a functional 
derivative thereof • 

12. Substantially pure CD22 antigen, or a functional 
derivative thereof. 

13. Substantially pure CD22 antigen, or a functional 
derivative thereof . 



14. Substantially pure CD4 0 antigen, or a 
derivative thereof . 



functional 
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15. Substantially pure cDNA comprising the nucleotide 
sequence shown in Figure 15, or a functional 
derivative thereof „ 

16 • Substantially pure cDNA comprising the nucleotide 
sequence shown in Figure 16, or a functional 
derivative thereof- 

17 o Substantially pure cDNA comprising the nucleotide 

sequence shown in Figure 13, or a functional 
derivative thereof • 

18 c Substantially pure cDNA comprising the nucleotide 

sequence shown in Figure 12, or a functional 
derivative thereof » 

19. Substantially pure cDNA comprising the nucleotide 
sequence shown in Figure 14, or a functional 
derivative thereof <> 

20 o Substantially pure cDNA comprising the nucleotide 

sequence shown in Figure 11, or a functional 
derivative thereof., 

21 o Substantially pure cDNA comprising the nucleotide 

sequence shown in Figure 17, or a functional 
derivative thereof, 

22 o Substantially pure protein having the amino acid 
sequence shown in Figure 10, or a functional 
derivative thereof * 

23 • Substantially pure protein having the amino acid 
sequence shown in Figure 11, or a functional 
derivative thereof . 
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24. A pharmaceutical composition comprising substantially 
pure cell surface antigen, said cell surface antigen 
present in soluble form or in the form of an 
acceptable salt . 

25. A pharmaceutical composition of claim 57 additionally 
comprising an acceptable carrier. 

26. A pharmaceutical composition of claim 25, wherein the 
cell surface antigen is selected from the group 
consisting of CDla, CDlb, CDlc, CDS , CD13 , CD14 , CD16, 
CD19, CD20, CD22, CD26, CD27, CD31, CDw32a, CDw32b, 
CD33, CD34, CD36, CD37, CD38, CD39, CD40, CD43, I CAM, 
FcRIa, FcRIb, TLiSa, and Lev-8, or a functional 
derivative of said cell surface antigen. 

27. A pharmaceutical composition of claim 26, additionally 
comprising a therapeutic agent. 

28. An immunotherapeutic agent, comprising substantially 
pure cell surface protein, an acceptable carrier and a 
therapeutic agent. 

29. A kit useful for immunodiagnostic assay, comprising 
substantially pure cell surface antigen in soluble 
form and a containing means • 
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1 GGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT 

51 TTGTTTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG GAACTGGCTT 

101 CAGCAGAGCG CAGATACCAA ATACTGTCCT TCTAGTGTAG CCGTAGTTAG 

151 GCCACCACTT CAAGAACTCT GTAGCACCGC CTACATACCT CGCTCTGCTA 

201 ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC GATAAGTCGT GTCTTACCGG 

251 GTTGGACTCA AGACGATAGT TACCGGATAA GGCGCAGCGG TCGGGCTGAA 

301 CGGGGGGTTC GTGCACACAG CCCAGCTTGG AGCGAACGAC CTACACCGM 

351 CTGAGATACC TACAGCGTGA GCTATGAGAA AGCGCCACGC TTCCCGAAGG 

401 GAGAAAGGCG GACAGGTATC CGGTAAGCGG CAGGGTCGGA ACAGGAGAGC 

451 GCACGAGGGA GCTTCCAGGG GGAAACGCCT GGTATCTTTA TAGTCCTGTC 

501 GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG 

551 GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCCGAATTa ccgcggtCTT 

601 TCTCAACGTA ACACTTTACA GCGGCGCGTC ATTTGATATG ATGCGCCCCG 

651 CTTCCCGATA AGGGAGCAGG CCAGTAAAAG CATTACCCGT GGTGGGGTTC 

701 CCGAGCGGCC AAAGGGAGCA GACTCTAAAT CTGCCGTCAT CGACTTCGM 

751 GGTTCGAATC CTTCCCCCAC CACCATCACT TTCAAAAGTC CGAAAGAAtC 

801 TGCTCCCTGC TTGTGTGTTG GAGGTCGCTG AGTAGTGCGC GAGTAAAATT 

851 TAAGCTACAA CAAGGCAAGG CTTGACCGAC AATTGCATGA AGAATCTGCT 

901 TAGGGTTAGG CGTTTTGCQC TGCTTCGCGA TGTACGGGCC AGATATACGC 

951 GTTGACATTG ATTATTGACT AGTTATTAAT AGTAATCAAT TACGGGGTCA 

1001 TTAG1TCATA GCCCATATAT GGAGTTCCGC GTTACATAAC TTACGGTAAA 

1051 TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA 

1101 TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 

1151 TGGGTGGACT AT7TACGGTA MCTGCCCAC TTGGCAGTAC ATCAAGTGTA 

1201 TCATATGCCA AGTACGCCCC CTATTGACGT CAATGACGGT AMTGGCCCG 

• 1251 CCTGGCA7TA TGCCCAGTAC ATGACCTTAT GGGACTTTCC TACTTGGCAG 

1301 TACATCTACG TATTAGTCAT CGCTATTACC ATGGTGATGC GGTTTTGGCA 

1351 GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA TTTCCAAGTC 

1401 TCCACCCCAT TGACGTCAAT GGGAG7TTGT TTTGGCACCA AAATCAACGG 

1451 GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG 

1501 AATTCCTGGG CGGGACTGGG GAGTGGCGAG CCCTCAGATG CTGCATATAA 

1551 GCAGCTGCTT TTTGCCTGTA CTGGGTCTCT CTGGTTAGAC CAGATCTGAG 

1601 CCTGGGAGCT CTCTGGCTAA CTAGAGAACC CACTGCTTAA GCCTCAATAA 

1651 AGCTTCTAGA GATCCCTCGA CCTCGAGGGA TCTTCCATAC CTACCAGTTC 
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1701 TGCGCCTGCA GGTCGCGGCC GCGACTCTAG AGGATCTTTG TGAAGGAACC 
1751 TTACTTCTGT GGTGTGACAT AATTGGACAA ACTACCTACA GAGATTTAAA 
1801 GCTCTAAGGT AAATATAAAA TTTTTAAGTG TATMTGTGT TAAACTACTG 
1851 ATTCTAAnG TTTGTGTATT TTAGATTCCA ACCTATGGAA CTGATGAATG 
1901 GGAGCAGTGG TGGAATGCCT TTAATGAGGA AAACCTGTTT TGCTCAGAAG 
1951 AAATGCCATC TAGTGATGAT GAGGCTACTG CTGACTCTCA ACATTCTACT 
2001 CCTCCAAAAA AGAAGAGAAA GGTAGAAGAC CCCAAGGACT TTCCTTCAGA 
2051 ATTGCTAAGT TTTTTGAGTC ATGCTGTGTT TAGTAATAGA ACTCTTGCTT 
2101 GCTTTGCTAT TTACACCACA AAGGAAAAAG CTGCACTGCT ATACAAGAAA 
2151 ATTATGGAAA AATATTCTGT AACCTTTATA AGTAGGCATA ACAGTTATAA 
2201 TCATAACATA CTGI II 1 1 IC TTACTCCACA CAGGCATAGA GTGTCTGCTA 
2251 TTAATAACTA TGCTCAAAAA TTGTGTACCT TTAGCTTTTT AATTTGTAAA 
2301 GGGGTTAATA AGGAATATTT GATGTATAGT GCCTTGACTA GAGATCATAA 
2351 TCAGCCATAC CACATTTGTA GAGGTTTTAC TTGCTTTAAA AAACCTCCCA 
2401 CACCTCCCCC TGAACCTGAA ACATAAAATG AATGCAATTG TTGTTGTTAA 
2451 CTTGTTTATT GCAGCTTATA ATGGTTACAA ATAAAGCAAT AGCATCACAA 
2501 ATTTCACAAA TAAAGCATTT TTTTCACTGC ATTCTAGTTG TGGTTTGTCC 
2551 AAACTCATCA ATGTATCTTA TCATGTCTGG ATCCTGTGGA ATGTGTGTCA 
2601 GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC AGCAGGCAGA AGTATGCAAA 
2651 GCATGCATCT CAATTAGTCA GCAACCAGGT GTGGAAAGTC CCCAGGCTCC 
2701 CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAT 
2751 AGTCCCGCCC CTAACTCCGC CCATCCCGCC CCTAACTCCG CCCAGTTCCG 
2801 CCCATTCTCC GCCCCATGGC TGACTAATTT T7TTTATTTA TGCAGAGGCC 
2851 GAGGCCGCCT CGGCCTCTGA GCTATTCCAG AAGTAGTGAG GAGGCTTT7T 
2901 TGGAGGCCTA GGCTTTTGCA AAAAGCTaat TC 
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CCTAAGATGAGCTTTCCATGTAAATTTGTAGCCAGCTTCCTTCTGAI 1 1 ICAATGTTTCT 
MetSerPheProCysLysPheValAlaSerPheLeuLeuIlePheAsnValSer 


(60) 




TCCAMGGTGCAGTCTCCAAAGAGAmCGMTGCCTTGGAMCCTGGGGTGCCnGGGt 
SerLysGlyAlaValSerLysGluIleThrAsnAlaLelGluThrTrpGlyAlaLeuGly 


(120) 


20 


1 

CAGGACATCAACTTGGACATTCCTAGI 1 1 ICAAATGAGTGATGATATTGACGATATAAAA 
GlnAspIleAsnLeuAspIleProSerPheGlnMetSerAspAspIleAspAspIleLys 


(180) 


40 


TGGGAAAAMCrrCAGACAAGAAAMGATtGCACMTTCAGAAMGACJAM 
TrpGluLysThrSerAspLysLysLysIleAlaGlnPheArgLysGluLysGluThrPhe 


(240) 


60 


MGGAAAM^TACATATMGCTATTTAAAMTO 

LysGluLysAspThrTyrLysLeuPheLysAsnGlyThrLeuLysIleLysHisLelLys 

. — CHO — . 

ACCGATGj\TCAG(^TATCTACMGGTATCMTATATGATACAAAAGGAAAAAATGTGTTG 
ThrAspAspGlnAspIleTyrLysValSerIleTyrAspThrLysGlyLysAsnValLeu 


(300) 


80 


(360) 


100 


gaaaamtatttgatitgmgancmga(^gggtctcaamccaaagatctcctggact 
GluLysIlePheAspLeuLysIleGlnGluArgValSerLysProLysIleSerTrpThr 


(420) 


120 


TGTATCMCACMCCCTGACCTGTGAGGTMTGMTGGAACTGACCCCGMTTAAACCTG 
CysIleAsnThrThrLeuThrCysGluValMetAsnGlyThrAspProGluLeuAsnLeu 

— CHO — . . — CHO — 
TATCMGATGGGAMCATCTAAAACTTTCTCAGAGGGTCATCACACACAAGTGGACCACC 
TyrGlnAspGlyLysHisLbjLysLeuSerGlnArgValIleThrHisLysTrpThrThr 


(480) 


140 


(540) 


160 


agcctgagtgcaamttcmgtgcacagcagggmcamgtcagcmgg^atccagtgtc 
SerLeuSerAlaLysPheLysCysThrAlaGlyAsnLysValSerLysGluSerSerVal 


(600) 


180 


GAC^CTGTCAGCTGTCCAGAGAMGGTCTGGACATCTATCTCATCATTGGCATATGTGGA 
GluProValSerCysProGluLysGlyLeuAspIleTyrLeuIleIleGlyIleCysGly 


(660) 


200 


ggaggcagcctcttgatggtctttgtggcactgctcgttttctatatcaccaaaaggaaa 
GlyGlySerLeuLbJ^ValPheValAlaLeuLeuValPheTyrIleThrLysArgLys 
TM . 

AAALA(jAvjVjA(j 1 LWjAvjAAA 1 uA 1 uAbbAuL 1 bbAuAL/VAw\UL^V.ACAoMVj 1 MoL 1 rtv- 1 

LysGuvjaegSerArgArgAsnAspGluGluLeuGluThrArgAlaHisArgValAlaThr 


(720) 


220 


\ 1 OKJJ 


240 


GMGAMGGGGCCGGMGCCCCMCAMTtCCAGCTTCAACCCCTCAGAATCCAGCAACt 
GluGluArgGlyArgLysProGlnGlnIlePrqAlaSerThrProGlnAsnProAlaThr 


(840) 


260 


TCCCMCATCCTCCTCCACCACCTGGTCAtCGTTCCCAGGCACCTAGTCATCGTCCCCCG 
SerGlnHisProProProProProGlyHisArgSerGlnAlaProSerHisArgProPro 


(900) 


280 


CCTCCTGGACACCGTGTTCAGCACCAGCCtCAGAAGAGGCCTCCTGCTCCGTCGGGCACA 
ProProGlyHisArgValGlnHisGlnProGlnLysArgProProAlaProSerGlyThr 


(960) 
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CMGnCACCAGCAGAMGGCCCGCCCCTCCCCAGACCTCGAGTTCAGCCAAAACCTCCC (1020) 
300 GlnValHisGlnGlnLysGlyProProLeuProArgProArgValGlnProLysProPro 

CATGGGGCAGCAGAAMCTCATTGTCCCCtTCCTCTAAnAAAAAAGATAGAAACTGTCt (1080) 
320 HisGlyAlaAlaGluAsnSerLeuSerProSerSerAsnEnd 

TTTrCMTAAAMGCACTGtGGATnCTGCCCTCCTGATGTGCATATCCGTACTTCCATG (1 140) 

AGGTGTTTTCTGTGTGCAGAACATTGTCACCTCCTGAGGCTGTGGGCCACAGCCACCTCt ( 1 200) 

GCATCnCGAACTCAGCCAtGTGGTCMCATCTGGAGTTiTTGGTCTCCtCAGAGAGCTC (1260) 

CATCACACCAGTMGGAGMGCMTATMG^GTGAnGCMGAATGGTAGAGGACCGAGC (1320) 

ACAGAMTCtTAGAGATTTCnGTCCCCTCTCAGGTCATGTGTAGATGCGATAMTCAAG (1380) 

TGAnGGTGtGCCTGGGTCTCACTACMGCAGCCTATCTGCnMClAGACTCTGOAGTTt (1440) 

CTTATGTGCCCTGGTGGACACnGCCCACCATCCTGTGAGTAAMGTGAAATAAMGCTT (1500) 

TGAC (1504) 
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1 GGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT 

51 TTGTTTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAACTGGCT 

101 TCAGCAGAGC GCAGATACCA AATACTGTCC TTCTAGTGTA GCCGTAGTTA 

151 GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT 

201 AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG 

251 GGTTGGACTC AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA 

301 ACGGGGGGTT CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTACACCGA 

351 ACTGAGATAC CTACAGCGTG AGCATTGAGA AAGCGCCACG CTTCCCGAAG 

401 GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG 

451 CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT ATAGTCCTGT 

501 CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG 

551 GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCAAGCTA GCTTCTAGCT 

601 AGAAATTGTA AACGTTAATA TTTTGTTAAA ATTCGCGTTA AATTTTTGTT 

651 AAATCAGCTC ATTTTTTAAC CAATAGGCCG AAATCGGCAA AATCCCTTAT 

701 AAATCAAAAG AATAGCCCGA GATAGGGTTG AGTGTTGTTC CAGTTTGGAA 

751 CAAGAGTCCA CTATTAAAGA ACGTGGACTC CAACGTCAAA GGGCGAAAAA 

801 CCGTCTATCA GGGCGATGGC CGCCCACTAC GTGAACCATC ACCCAAATCA 

851 AGI 1 1 1 1 IGG GGTCGAGGTG CCGTAAAGCA CTAAATCGGA ACCCTAAAGG 

901 GAGCCCCCGA TiTAGAGCTT GACGGGGAAA GCCGGCGAAC GTGGCGAGAA 

951 AGGAAGGGAA GAAAGCGAAA GGAGCGGGCG CTAGGGCGCT GGCAAGTGTA 

1001 GCGGTCACGC TGCGCGTAAC CACCACACCC GCCGCGCTTA ATGCGCCGCT 

1051 ACAGGGCGCG TACTATGGTT GCTTTGACGA GCACGTATM CGTGCTTTCC 
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1101 TCGTTGGAAT CAGAGCGGGA GCTAAACAGG AGGCCGATTA AAGGGATTTT 

1151 AGACAGGAAC GGTACGCCAG CTGGATCACC GCGGTCTTTC TCAACGTAAC 

1201 ACTTTACAGC GGCGCGTCAT TTGATATGAT GCGCCCCGCT TCCCGATAAG 

1251 GGAGCAGGCC AGTAAAAGCA TTACCCGTGG TGGGGTTCCC GAGCGGCCAA 

1301 AGGGAGCAGA CTCTAAATCT GCCGTCATCG ACTTCGAAGG TTCGAATCCT 

1351 TCCCCCACCA CCATCACTTT CAAAAGTCCG AAAGMtCTG CTCCCTGCTT 

1401 GTGTGTTGGA GGTCGCTGAG TAGTGCGCGA GTAAMTTTA AGCTACAACA 

1451 AGGCAAGGCT TGACCGACAA TTGCATGAAG AATCTGCTTA GGGTTAGGCG 

1501 TTTTGCGCTG CTTCGCGATG TACGGGCCAG ATATACGCGT TGACATTGAT 

1551 TATTGACTAG TTATTAATAG TAATCAATTA CGGGGTCATT AGTJCATAGC 

1601 CCATATATGG AGJTCCGCGT TACATAACTT ACGGTAAATG GCCCGCCTGG 

1651 CTGACCGCCC AACGACCCCC GCCCATTGAC GTCAATAATG ACGTATGTTC 

1701 CCATAGTAAC GCCAATAGGG ACTTTCCATT GACGTCAATG GGTGGACTAT 

1751 TTACGGTAAA CTGCCCACTT GGCAGTACAT CAAGTGTATC ATATGCCAAG 

1801 TACGCCCCCT ATTGACGTCA ATGACGGTAA ATGGCCCGCC TGGCATTATG 

1851 CCCAGTACAT GACCTTATGG GACTTTCCTA- CTTGGCAGTA CATCTACGTA 

1901 TTAGTCATCG CTATTACCAT GGTGATGCGG TTTTGGCAGT ACATCAATGG 

1951 GCGTGGATAG CGG1TTGACT CACGGGGATT TCCAAGTCTC CACCCCATTG 

2001 ACGTCAATGG GAGTTTGTTT TGGCACCAAA ATCAACGGGA CTTTCCAAAA 

2051 TGTCGTAACA ACTCCGCCCC ATTGACGCAA ATGGGCGGAA TTCCTGGGCG 

2101 GGACTGGGGA GTGGCGAGCC CTCAGATGCT GCATATAAGC AGCTGCTTTT 

2151 TGCCTGTACT GGGTCTCTCT GGTTAGACCA GATCTGAGCC TGGGAGCTCT 

2201 CTGGCTAACT AGAGAACCCA CTGCTTAAGC CTCAATAAAG CTTCTAGAGA 

2251 TCCCTCGACC TCGAGATCCA TTGTGCTGGC GCGGATTCTT TATCACTGAT 
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2301 MGTTGGTGG ACATATTATG TTTATCAGTG ATAAAGTGTC AAGCATGACA 

2351 AAGTTGCAGC CGAATACAGT GATCCGTGCC GCCCTAGACC TGTTGAACGA 

2401 GGTCGGCGTA GACGGTCTGA CGACACGCAA ACTGGCGGAA CGGTTGGGGG 

2451 TTCAGCAGCC GGCGCTTTAC TGGCACTTCA GGAACAAGCG GGCGCTGCTC 

2501 GACGCACTGG CCGAAGCCAT GCTGGCGGAG AATCATAGCA CTTCGGTGCC 

2551 GAGAGCCGAC GACGACTGGC GCTCATTTCT GACTGGGAAT GCCCGCAGCT 

2601 TCAGGCAGGC GCTGCTCGCC TACCGCCAGC ACAATGGATC TCGAGGGATC 

2651 TTCCATACCT ACCAGTTCTG CGCCTGCAGG TCGCGGCCGC GACTCTAGAG 

2701 GATCTTTGTG AAGGAACC7T ACTTCTGTGG TGTGACATAA TTGGACAAAC 

2751 TACCTACAGA GATTTAAAGC TCTAAGGTAA ATATAAAATT TTTAAGTGTA 

2801 TAATGTGTTA AACTACTGAT TCTAATTGTT TGTGTATTTT AGATTCCAAC 

2851 CTATGGAACT GATGAATGGG AGCAGTGGTG GAATGCCTTT AATGAGGAAA 

2901 ACCTGTTTTG CTCAGAAGAA ATGCCATCTA GTGATGATGA GGCTACTGCT 

2951 GACTCTCAAC ATTCTACTCC TCCAAAAAAG AAGAGAAAGG TAGAAGACCC 

3001 CAAGGACTTT CCTTCAGAAT TGCTAAGTTT TTTGAGTCAT GCTGTGTTTA 

3051 GTAATAGAAC TCTTGCTTGC TTTGCTATTT ACACCACAM GGAAAAAGCT 

3101 GCACTGCTAT ACAAGAAAAT TATGGAAAAA TATTCTGTAA CCTTTATAAG 

3151 TAGGCATAAC AGTTATAATC _ ATAACATACT GTTTTTTCTT ACTCCACACA 

3201 GGCATAGAGT GTCTGCTATT AATAACTATG CTCAAAAATT GTGTACCTTT 

3251 AGCTTTTTAA TFTGTAAAGG GGTTAATAAG GAATATTTGA TGTATAGTGC 

3301 CTTGACTAGA GATCATAATC AGCCATACCA CATTTGTAGA GGTTTTACTT 

3351 GCTTTAAAAA ACCTCCCACA CCTCCCCCTG AACCTGAAAC ATAAAATGAA 

3401 TGCAATTGTT GTTGTTAACT TGTTTATTGC AGCTTATAAT GGTTACAAAT 

3451 AAAGCAATAG CATCACAMT TTCACAAATA AAGCATTTTT TTCACTGCAT 
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ACTCATCAAT GTATCTTATC ATGTCTGGAT 
TAGGGTGTGG AAAGTCCCCA GGCTCCCCAG 
ATGCATCTCA ATTAGTCAGC AACCAGGTGT 
AGCAGGCAGA AGTATGCAAA GCATGCATCT 
TCCCGCCCCT AACTCCGCCC ATCCCGCCCC 
CATTCTCCGC CCCATGGCTG ACTAATTTTr 
GGCCGCCTCG GCCTCTGAGC TATTCCAGAA 
GAGGCCTAGG CTTTTGCAAA AAGCTAATTC 
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AGACTCTCAGGCCTTGGCAGGTGCGTCTTtCAGTTCCCCtCACACnCGGGTTCCTCGGG (50) 

GAGGAGGGGCTGGAACCCTAGCCCATCGTCAGGACAAAGATGCTCAGGCtGCTCTTGGCT (120) 

MetLeuArgLelLeuLeuAla 
-18 

CTCMCmTTCCCTTCMnCAAGTAACAGGAAACAAGATTTTGGTGAAGCAGTCGCCC ( 180) 
LeuAsnLhjPheProSerIleGlnValThrGlyAsnLysIleLeuValLysGlnSerPro 

. +1 

ATGCTTGTAGCGTACGACAATGCGGTCAACCTTAGCTGCAAGTATTCCTACAATCTCTTC (240) 

1 0 METLEUValAlaTyRASPAS^LAVaL ASNLEUSERCYSLYSTYRSERTYRASNl_EUPl-e 

— CHO — 

TCAAGGGAGTTCCGGGCATCCCTTCACAAAGGACTGGATAGTGCTGTGGAAGTCTGTGTT (300) 
30 SerArgGluPheArgAlaSerLeuHisLysGlyLeuAspSerAlaValGluValCysVal 

GTATATGGOAAnACTCCCAGCAGCnCAGGmACTCAAAAACGGGGTtCAACTGTGAt (360) 
50 ValTyrGlyasnTyrSerGlnGlnLeuGlnValTyrSerLysThrGlyPheAsnCysAsp 
----CHO — ..... 
GGGAMTTGGGCAATGAATCAGTGACATTCTACCTCCAGAATTTGTATGTTAACCAAACA (420) 
70 GlyLysLeuGlyAsnGluSerValThrPheTyrLeuGlnAsnLeuTyrValAsnGlnThr 
, — CHO — . . . — CHO — 

GATATTTACnCTGCAAMTTGMGTTATGTATCCTCCTCCTTACCTAGACAATGAGAAG (480) 
90 AspIleTyrPheCysLysIleGluValMetTyrProProProTyrLeuAspAsnGluLys 

AGCMTGGMCCATTATCCATGTC^W\GGGAMCACC7TTGTCCMGTCCCCTATTTCCC (540) 

110 SerAsnGlyThrIleIleHisValLysGlyLysHisLelCysProSerProLeuPhePro. 

— CHO — ..... 

GGACCTTCTMGCCCTTTTGGGTGCTGGTGGTGGTTGGTGGAGTCCTGGCTTGCTATAGC (600) 

130 GlyProSerLysProPheTrpValLeuValValValGlyGlyValLeuAlaCysTyrSer 



1TGCTAGTMCAGTGGCCT1TATTATTTTCTGGGTGAGGAGTAAGAGGAGCAGGCTCCTG (660) 
150 L£uI^uValThrValAj\Pf£Ii^Ii^PheTrpVai^gSerLysAjrgSerArgLeuLeu 



CACAGTGACTACATGAACATGACTCCCCGCCGCCCCGGGCCCACCCGCAAGCATTACCAG (720) 
170 HisSerAspTyrMetAsnMetThrProArgArgProGlyProThrArgLysHisTyrGln 

CCCTAT(X:CCCACCACGCC^CTTCGCAGCCTATCGCTCCtGACACGGACGCCTATCCAGA (780) 
190 ProTyrAlaProProArgAspPi-eAlaAlaTyrArgSer&vd 

202. 

AGCCAGCCGGCTGGCAGCCCCCATCTGCTCAATATCACTGCTCTGGATAGGAAATGACCG (840) 

CCATCTCCA(XCGGCCACCTCAGCCCCTGtTGGGCCACCMTGCCMTllTTCTCGAGTG (900) 

ACTAGACCAAATATCM(^TCATmGAGACTCTG^ (960) 

GACAGGCCAAGTCnACAGtGCCATGGCCCACATTCCMCTTACCATGTACTTAGTGACt (1020) 

TGACT(^GAAGnAGGGTAGAAMCAAAAAGGC^GTG(^trCTGGGAGCCTCTTCCCTTt (1080) 
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CTCACTCACCTGCACATCTCAGTCMGCAAAGTGTGGTAtCCACAGACAiTTTAGTTGCA (1140) 

GMGAMGGCTA(X^MTCAnCCTTTTGGn^ (1200) 

GGnAMCGGGGTMGnAGAGTAGGGGGAGGGATAGGAAGACATAmAAAAACCATTA (1260) 

AMCACTGTCTCCCACTCAtGAMTGAGCCACGTAGnCCTAmMTGCTGTmCCTt (1320) 

TAGmAGAAATACATAGACAnGTCTTTrATGMTTCTGATCATAmAGTCATm (1380) 

CCAMTGAGGGATTTGGTC.4MTGAGG(^lTCCCTCAMGCAATATCAGGTAAACCAAGf (1440) 

TGCmcaCACTCCCTGTCATGAGACTTCAGTGTTMTGnCACMTATACmC^ (1500) 
GAATAAAATAGTTC (1514) 

FIG. 7. (contj 
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TAGACCCAGAGAGGCTCAGCTGCACTCGCCCGGCTGGGAGAGCTGGGTGTGGGGMCATG 

Met 

GCCGGGCCTCCGAGGCTCCTGCTGCTGCCCCTGCTTCTGGCGCTGGCTCGCGGCCTGCCt 
AlaGlyProProArgLeuLeuLeuLeuProLeuLeuLeuAlaLeuAlaArgGlyLeuPro 

GGGGCCCTGGCtGCCCMGGTMGAOITTCCCAGGCTCTCCATGGCCACAGCTCCGGAGC 
GlyAlaLeuAlaAlaGln / 

TCTCCCTGCCCCATGAGCTCAGAGCCCCCAGTCTGAGCCACAGCACAGCCCCCAGGAAGC 

GGGTGGGGTGCTGAGCGGCCTCCAGTGTCTGAGC^CTCAtTTMGAC^GGAAAAAGGGt 

GGACCCGGTGGGGAGTGGCCGGGGCTGTCCAGGCAGGGCCGCTGCTTTGGGAGGAAGAAG 

CCCACAGTCtCGGAACACGAGGACAGCACCTCCCCCAACACCACAGCCGGTGCCCAGATC 

TGCTCCATGCCCCGTMGGCACCGTGTCTTTGGCGACATGTCAGCCCTGGGCTGTCTCAG 

GGCCCCACCATCCCCACCACTGTCCCCTGCAGGGAGGACATTCTCTGTCCTTCTGGCCAG 

/ 

ACTGATGGTGACAGCCCAGGTCCTCCCAGAGGTGCAGCAGTCTCCCCACTGCACGACTGT 

GluValGlnGlnSerProHisCysThrThrVa 

CCCCGTGGGAGCCTCCGTCMCATCACCTGCTCCACCAGCGGGGGCCTGCGTGGGATCTA 
lProValGlyAlaSerValAsnIleThrCysSerThrSerGlyGlyLeuArgGlyIleTy 



CCTGAGGCAGCTCGGGCCACAGCCCCMGACATCATTTACTACGAGGACGGGGTGGTGCC 
rLhjArgGlnLelGlyProGlnProGlnAspIleIleTyrTyrGluAspGlyValValPr 

CACTACGGACAGACGGTTCCGGGGCCOtCGACTTCTCAGGGTCCCAGGACMCCTGAC 
oThrThrAspArgArgPheArgGlyArgIleAspPheSerGlySerGlnAspasnLeuTh 



TATCACCATGCACCGCCTGCAGCTGTCGGACACTGGGACCTACACCTGCCAGGCCATCAC 
rIleThrMetHisArgLelGlnLbjSerAspThrGlyThrTyrThrCysGlnAlaIleTh 

GGAGGTCMtGTCTACGGCtCCGGCACCCtGGTCCTGGTGACAGAGGAACAGTCCCAAGG 
rGluValAsnValTyrGlySerGlyThrLhjValL^ValThrGluGllGlnSerGlnGl 

AT^CACAGATGCTCGGACGCCCCACCMGGGCCTCTGCCCTCCCTGCCCCACCGACAGG 
yTrpHisArgCysSerAspAlaProProArgAlaSerAlaLeuProAlaProProThrGl 

CTCCGCCCTCCCTGACCCGCAGACAGCCTCTGCCCTCCCtGACCCGCCAGCAGCCTCTGC 
ySerAlaLeuProAspProGlnThrAlaSerAlaLeuProAspProProAlaAlaSerAl 

CCTCCCTGCGGCCCTGGCGGTGATCTCCTtCCTCCTCGGGCTGGGCCTGGGGGTGGCGTG 
aLeuProAlaAlaLeuAlaValIleSerPheLeuLeuGlyLeuGlyLeuGlyValAlaCy 



(60) 

(120) 

(180) 

(240) 
(300) 
(360) 
(420) 
(480) 
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(600) 
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TGTGCTGGCGAGGACACAGATAAAGAAACTGTGCTCGTGGCGGGATAAGAATTCGGCGGC ( 1 1 40) 
sValLeuAlaArgThrGlnIleLysLysLeuCysSerTrpArgAspLysAsnSerAlaAl 



ATGTGTGGTGTACGAGGACATGTCGCACAGCCGCTGCAACACGCTGTCCTCCCCCAACCA (1200) 
aCysValValTyrGluAspMetSerHisSerArgCysAsnThrLeuSerSerProAsnGl 

GTACCAGTGACCCAGTGGGCCCCTGCACGtCCCGCCTGTGGTCCCCCCAGCACCTTCCCt (1260) 
nTyrGlnEnd 

GCCCCACCATGCCCCCCACCCTGCCACACCCCTCACCCTGCTGTCCTCCCACGGCTGCA^ (1320) 

CAC^GTTrGAAGGGCCCAGCCGTGCCCAGCTCCAAGCAGACACACAGGCAGTGGCCAGGC (1380) 

CCCACGGTGCnCTCAGTGGACMTGATGCCTCCTCCGG(^GCCnCCCTGCCCAGCCC (1440) 

ACGCCGCCACCGC^GGMGCCTGACTGTCCTTTGGCTGCATCTCCCGACCATGGCCM (1500) 

(^GGGCTTTtCTGTGGGATGGGCCTGGCACGCGGCCCTCtCCTGTCAGT^CGC^CCACC (1560) 

CACCAGCAGGCCCCCAACCCCCAGGCAGCCCGGCAGAGGACGGGAGGAGACCAGTCCCCC (1520) 
ACCCAGCCGtACCAGAMTAMGGCTTCTGTGCTTCAAAAAAAAA (1665) 



FIG. 8. (cont.) 



WO 89/08114 



PCT/US88/02809 



CCCAAATGTCTCAGAATGTATGTCCCAGAAACCTGTGGCTGCTTCAACCATTGACAGTTT (50) 
MetSerGlnAsnValCysProArgAsnLeuTrpLeuLbXlnProLeuThrValL 
-29 . 

TGCTGCTGCTGGCTTCTGCAGACAGTCAAGCTGCAGCTCCCCCAAAGGCTGTGCTGAAAC (120) 
euLeuLeuLeuAlaSerAlaAspSerGlnAlaAlaAlaProProLysAlaValLeuLysL 

.-1 +1 

TTGAGCCCCCGTGGATCAACGTGCTCCAGGAGGACTCTGTGACTCTGACATGCCAGGGGG (180) 
10 elGluProProTrpIleAsnValLelGlnGluAspSerValThrLbjThrCysGlnGlyA 

* 

CTCGCAGCCCTGAGAGCGACTCCATTCAGTGGTTCCACAATGGGAATCTCATTCCCACCC (240) 
30 laArgSerProGluSerAspSerIleGlnTrpPheHisAsnGlyAsnLeuIleProThrH 

ACACGCAGCCCAGCTACAGGTTCAAGGCCAACAACMTGACAGCGGGGAGTACACGTGCC (300) 
50 isThrGlnProSerTyrArgPheLysAlaAsnAsnAsnAspSerGlyGluTyrThrCysG 

— CHO — . * . 

AGACTGGCCAGACCAGCCTCAGCGACCCTGTGCATCTGACTGTGCTTTCCGMTGGCTGG (360) 
70 lnThrGlyGlnThrSerLeuSerAspProValHisLeuThrValLeuSerGluTrpLeuV 

TGCTCCAGACCCCTCACCTGGAGTTCCAGGAGGGAGAAACCATCATGCTGAGGTGCCACA (420) 
90 alLeuGlnThrProHisLeuGluPi^Gl^luGlyGluThrIleMetLeuArgCysHisS 

* 

GCTGGAAGGACAAGCCTCTGGTCAAGGTCACATTCTTCCAGAATGGAAAATCCCAGAAAt (480) 
1 10 erTrpLysAspLysProLeuValLysValThrPhePheGlnAsnGlyLysSerGlnLysP 

TCTCCCGTTtGGATCCCACCTTCTCCATCCCACM(XAMCCACAGTCACAGTGGTGATt (540) 
130 heSerArgLeuAspProThrPheSerIleProGlnAlaAsnHisSerHisSerGlyAspT 

— CHO — . 

ACCACTGCACAGGAAACATAGGCTACACGCTGTTCTCATCCAAGCCTGTGACCATCACTG (600) 
150 yrHisCysThrGlyAsnIleGlyTyrThrLeuPheSerSerLysProValThrIleThrV 

TCCMGTGCCCAGCATGGGCAGCTCnCACCMTGGGGAtCATTGTGGCtGTGGTCATTG (660) 
170 alGu^ValProSerMetGlySerSerSerProMetGlyIleIleValAlaValValIleA 



CGAeTGCTGTAGCAGCCATTGTTGCTGCTGTAGTGGCCTTGATCTACTGCAGGAAAAAGC (-720) 
190 laThrAlaValAlaAlaIleValAlaAlaValValAlaLeuIleTyrCysArgLysLysA 
™ *- 

GGATnCAGCCAAnCCACTGATCCTGTGAAGGCTGCCCAATTTGAGCCACCTGGACGTC (780) 

210 RGIlE$ERA|^S^ERThrASPPR0ValLysAlaAi^GlnPheGLUPR0PR0GlyArgG 

AMTGATTGCCATCAGAMG^GACMCTTdAAGAMCCAACMTGACTAtGAMCAGCTG (840) 
230 lnMetIi^laIleArgLysArgGlnLelX^luGluThrAsnAsnAspTyrGluThrAlaA 

ACGGCGGCTACATGACTCTGAACCCCAGGGCACCTACTGACGATGATAAAAACATCTACC (900) 
250 spGlyGlyTyrMetThrLeuAsnProArgAlaProThrAspAspAspLysAsnIleTyrL 
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TGACTCTTCCTCCCAACGACCATGTCAACAGTAATAACTAAAGAGTAACGTTATGCCATG (960) 
270 euThrLeuProProAsnAspHisValAsnSerAsnAsnEnd 

282 . 

TGGTCATACTCTCAGCTTGCTGAGTGGATGACAAAMGAGGGGMTTGTTAAAGGAAAAT (1020) 

TTAMTGGA^CTGGAAAMTCCTGAGCAAACAAMCCACCTGGCCCnAGAMTAGC^ (1080) 

TMCmGCtTAMCTACAMCACMGCAAMCnCACGGGGTCATACTACATACAAGCA (1140) 

TMGCAAMCTTMCnGGATCATTTCTGCT^ (1200) 

CAGCCMTCACMGCAGCCTACTMCATAtMnAGGTGACTAGGGACTtTCTAAGAAGA (1260) 

TACCTACCCCCAAAAMCMmTGTMTtGAAMCCMCCGAnGC^ (1320) 

CCACATTfTCCCMTAMTACTTGCCTGT^ (1380) 

GMTTGCGCCTCAGATTTTtCCTTTAAGAtC 1 1 1 1 1 1 1 1 i 1 1 1 (LACAGAGTCTCAATCTG (1440) 

nACCCAGGCTGGAGTGCAGTGGTGCTATCTTGGCTCACtGCAMCCCGCCTCCCAG (1500) 

TMGCGATTCTCATGCCTCAGCCTCCCAGTAGCTGGGATtAGAGGCATGfGCCATCATAC (1560) 

CGAGCTMTlTnGTATTTiTTA 1 1 1 1 1 I I 1 1 1 1 IAGTAGAGACAGGGTTTCGCAATGTT (1620) 

GGCCAGGCC6\TCTC(W\CtTCTGGCCTCtAGCGATCTGCCCGCCTCG(^CTCCCAMG^ (1680) 

GCTGGGATGACCAGCATCAGCCCCMTGTCCAGCCTCTTtMCATCnciTTCCTATGCC (1740) 

CTCTCTGTG^TCCCTACTGCTGGTnCTGCCnCTCCAtGCTGAGAACAAAATCACCTA (1800) 

nCACTGCTtATGCAGTCGGMGCTCCAGAAGMGWkGAGCCCAATTACCAGAACCACA (1860) 

nMGTCTCCAnGTTTTGCCTTffi^ (1920) 

TAT1TCCTG(^CTAMTTCCCCTTGGGGM^ (1980) 

AG(^CTCTTCCAGAGTCATCTACCTGAGTCCCAMGCTGCCTGTCCTGAMGCCACAG (2040) 

MTATGGTCCCAMTGACT(^CTGCACGTtCTGTGCCTCAGCCGTTCTT(^CATCMGAA (2100) 

TCnCTGnCCACATCCACACAGCCMTACMmGTCAAACaCTGTTATTAACAGATG (2160) 

TAGCMCAT^GAMCGCTTATGTTACAGGTTACATGAGAGCAATCATGtAAGTCTATAt (2220) 

(^TCAGAMTGTTAMAtAGACTMCCtCTMCM (2280) 
GGTGAAAAAA (2290) 
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1 . . GGAGAGTC TGACCACCAT GCCACCTCCT CGCCTCCTCT TCTTCCTCCT 
51 CHCCTCACC CCCATGGAAG TCAGGCCCGA GGAACCTCTA GTGGTGAAGG 
101 TGGAAGAGGG AGATAACGCT GTGCTGCAGT GCCTCAAGGG GACCTCAGAT 
151 GGCCCCACTC AGCAGCTGAC CTGGTCTCGG GAGTCCCCGC TTAAACCCTT 
201 CTTAAAACTC AGCCTGGGGC TGCCAGGCCT GGGAATCCAC ATGAGGCCCC 
251 TGGcCATCTG GCTTTTCATC TTCAACGTCT CTCAACAGAT GGGGGGCTTC 
301 TACCTGTGCC AGCCGGGGCC CCCCTCTGAG AAGGCCTGGC AGCCTGGCTG 
351 GACAGTCAAT GTGGAGGGCA GCGGGGAGCT GTTCCGGTGG AATGTTTCGG 
401 ACCTAGGTGG CCTGGGCTGT GGCCTGAAGA ACAGGTCCTC AGAGGGCCCC 
451 AGCTCCCCTT CCGGGAAGCT CATGAGCCCC AAGCTGTATG TGTGGGCCAA 
501 AGACCGCCCT GAGATCTGGG AGGGAGAGCC TCCGTGTGTC CCACCGAGGG 
551 ACAGCCTGAA CCAGAGCCTC AGCCAGGACC TCACCATGGC CCCTGGCTCC 
601 ACACTCTGGC TGTCCTGTGG GGTACCCCCT GACTCTGTGT CCAGGGGCCC 
651 CCTCTCCTGG ACCCATGTGC ACCCCAAGGG GCCTAAGTCA TTGCTGAGCC 
701 TAGAGCTGAA GGACGATCGC CCGGCCAGAG ATATGTGGGT AATGGAGACG 
751 GGTCTGTTGT TGCCCCGGGC CACAGCTCAA GACGCTGGAA AGTATTATTG 
801 TCACCGTGGC AACCTGACCA TGTCATTCCA CCTGGAGATC ACTGCTCGGC 
851 CAGTACTATG GCACTGGCTG CTGAGGACTG GTGGCTGGAA GGTCTCAGCT 
901 GTGACTTTGG CTTATCTGAT CTTCTGCCTG TGTTCCCTTG TGGGCATTCT . 
951 TCATCTTCAA AGAGCCCTGG TCCTGAGGAG GAAAAGAAAG CGAATGACTG 
1001 ACCCCACCAG GAGATTCTTC AAAGTGACGC CTCCCCCAGG AAGCGGGCCC 
1051 CAGAACCAGT ACGGGAACGT GCTGTCTCTC CCCACACCCA CCTCAGGCCT 
1101 CGGACGCGCC CAGCGTTGGG CCGCAGGCCT GGGGGGCACT GCCCCGTCTT 
1151 ATGGAAACCC GAGCAGCGAC GTCCAGGCGG ATGGAGCCTT GGGGTCCCGG 

FIG. 12. 



lyUSTOTUTE SHEET 



WO 89/08114 



PCT/US88/02809 



1201 AGCCGCCGGG AGTGGGCCCA GAAGAAGAGG AAGGGGAGGG CTATGAGGAA 

1251 CCTGACAGTG AGGAGGACTC CGAGTTCTAT GAGAACGACT CCAACCTTGG 

1301 GCAGGACCAG CTCTCCCAGG ATGGCAGCGG CTACGAGAAC CCTGAGGATG 

1351 AGCCCCTGGG TCCTGAGGAT GAAGACTCCT TCTCCAACGC TGAGTCTTAT 

1401 GAGAACGAGG ATGAAGAGCT GACCCAGCCG GTCGCCAGGA CAATGGACTT 

1451 CCTGAGCCCT CATGGGTCAG CCTGGGACCC CAGCCGGGAA GCAACCTCCC 

1501 TGGGGTCCCA GTCCTATGAG GATATGAGAG GAATCCTGTA TGCAGCCCCC 

1551 CAGCTCCGCT CCATTCGGGG CCAGCCTGGA CCCAATCATG AGGAAGATGC 

1601 AGACTCTTAT GAGAACATGG ATAATCCCGA TGGGCCAGAC CCAGCCTGGG 

1651 GAGGAGGGGG CCGCATGGGC ACCTGGAGCA CCAGGTGATC CTCAGGTGGC 

1701 . CAGCCTGGAT CTCCTCAAGT CCCCAAGATT CACACCTGAC TCTGAAATCT 

1751 GAAGACCTCG AGCAGATGAT GCCAACCTCT GGAGCAATGT TGCTTAGGAT 

1801 GTGTGCATGT GTGTAAGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT 

1851 ATACATGCCA GTGACACTTC CAGTCCCCTT TGTATTCCTT AAATAAACTC 

1901 AATGAGCTCT TCCAAAAAAA AAAA 

FIG. 12. (cont.) 



ilJBSTSTUTg SHEiT 



WO 89/08114 PCT/US88/02809 



1 ACAAAGACAA ACTGCACCCA CTGAACTCCG CAGCTAGCAT CCAAATCAGC 

51 CCTTGAGATT TGAGGCCTTG GAGACTCAGG AGTTTTGAGA GCAAAATGAC 

101 AACACCCAGA AATTCAGTAA ATGGGACTTT CCCGGCAGAG CCAATGAAAG 

151 GCCCTATTGC TATGCAATCT GGTCCAAAAC CACTCTTCAG GAGGATGTCT 

201 TCACTGGTGG GCCCCACGCA MGCTTC7TC ATGAGGGAAT CTAAGACTTT 

251 GGGGGCTGTC CAGATTATGA ATGGGCTCTT CCACATTGCC CTGGGGGGTC 

301 TTCTGATGAT CCCAGCAGGG ATCTATGCAC CCATCTGTGT GACTGTGTGG 

351 TACCCTCTCT GGGGAGGCAT TATGTATATT ATTTCCGGAT CACTCCTGGC 

401 AGCAACGGAG AAAAACTCCA GGAAGTGTTT GGTCAAAGGA AAAATGATAA 

451 TGAATTCATT GAGCCTCTTT GCTGCCATTT CTGGAATGAT TCTTTCAATC 

501 ATGGACATAC TTAATATTAA AATTTCCCAT TTTTTAAAAA TGGAGAGTCT 

551 GAATTTTATT AGAGCTCACA CACCATATAT TAACATATAC AACTGTGAAC 

601 CAGCTAATCC CTCTGAGAAA AACTCCCCAT CTACCCAATA CTGTTACAGC 

651 ATACAATCTC TGTTCTTGGG CATTTTGTCA GTGATGCTGA TCTTTGCCTT 

701 CTTCCAGGAA CTTGTAATAG CTGGCATCGT TGAGAATGAA TGGAAAAGAA 

751 CGTGCTCCAG ACCCAAATCT AACATAGTTC TCCTGTCAGC ACAAGAAAAA 

801 AAAGAACAGA CTATTGAAAT AAAAGAAGAA GTGGTTGGGC TAACTGAAAC 

851 ATCTTCCCAA CCAAAGAATG AAGAAGACAT TGAAATTATT CCAATCCAAG 

901 AAGAGGAAGA AGAAGAAACA GAGACGAACT TTCCAGAACC TCCCCAAGAT 

951 CAGGAATCCT CACCAATAGA AAATGACAGC TCTCCTTAAG TGATTTCTTC 

1001 TGTTfTCTGT TTCCII I II I AAACATTAGT GTTCATAGCT TCCAAGAGAC 

1051 ATGCTGACTT TCATTTCTTG AGGTACTCTG CACATACGCA CCACATCTCT 



FIG. 13 
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"/to 

1101 ATCTGGCCTT TGCATGGAGT GACCATAGCT CCTTCTCTCT TACATTGAAT 

1151 GTAGAGAATG TAGCCATTGT AGCAGCTTGT GTTGTCACGC TTCTTC"nTT 

1201 GAGCAACTTT CTTACACTGA AGAAAGGCAG AATGAGTGCT TCAGAATGTG 

1251 ATTTCCTACT AACCTGTTCC TTGGATAGGC TTTTTAGTAT AGTA I 1 1 1 1 1 

1301 TTTGTCATTT TCTCCATCAG CAACCAGGGA GACTGCACCT GATGGAAAAG 

1351 ATATATGACT GCTTCATGAC ATTCCTAAAC TATCTTTTTT TTATTCCACA 

1401 TCTACGTTTT TGGTGGAGTC CCTTTTTATC ATCCTTAAAA CAATGATGCA 

1451 AAAGGGCTTT AGAGCACAAT GGATCT 
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CCAGACACGA CACCATGCAT CTCCTCGGCC 
CTAGAATACT TGGCTTTCTC TGACTCAAGT 
TGAAACCCTC TACGCCTGGG AGGGGGCCTG 
ACAGAGCCCT AGATGGTGAC CTGGAAAGCT 
GAGTATAACA AGAACACCTC GAAGTTTGAT 
CACAAAGGAT GGGAAGGTTC CTTCTGAGCA 
GAGACAAGAA TAAGAACTGC ACACTGAGTA 
GACAGTGGTC AGCTGGGGCT GAGGATGGAG 
GGAACGAATA CACCTCAATG TCTCTGAAAG 
AGCTCCCTCC AGAAATTCAA GAGTCCCAGG 
CTGAATTTCT CCTGCTATGG GTATCCGATC 
GGGGGTTCCA ATGAGGCAGG CTGCTGTCAC 
AGTCTGTCTT CACCCGGAGC GAGCTCAAGT 
CATGGGAAGA TTGTGACCTG CCAGCTTCAG 
CTCCAATGAC ACGGTGCAGc TgAACGTGAA 
CCACAGTGAT TCAAAACCCC ATGCCGATTC 
CTTTCCTGTA ACTACAATTC CAGTAACCCC 
GAAACCCCAT GGCGCCTGGG AGGAGCCATC 
AAAACGTTGG CTGGGACAAC ACAACCATCG 
TGGTGCTCGT GGGCCTCCCC TGTCGCCCTG 
AGACGTGAGG GTCCGGAAAA TCAAGCCCCT 
ACTCGGTCAG CCTCCAATGT GACTTCTCAA 
CAGTTCTTCT GGGAGAAAAA TGGCAGGCTT 
GAATTTTGAC TCCATCTCCC CAGAAGATGC 
TGAACAACTC CATAGGACAG ACAGCGTCCA 
CTGTATGCAC CCAGGAGGCT GCGTGTGTCC 
GATGGAGGGG AAGAGTGCAA CCCTGACCTG 
CCGTCTCCCA CTACACCTGG TTTGACTGGA 
CACAGCCAGA AGCTGAGATt GGAGCCGGTG 
CTACTGGTGC CAGGGGACCA ACAGTGTGGG 
GCACCCTCAC CGTCTACTAT AGCCCGGAGA 
GTGGGACTCG GGTCCTGCCT CGCCATCCTC 
CAAGCTCCAG CGACGTTGGA AGAGGACACA 
GAGAATTCCA GCGGCCAGAG CTTCTTTGTG 
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1701 AGGAATAAM AGGTTAGAAG GGCCCCCCTC TCTGAAGGCC CCCACTCCCT 

1751 GGGATGCTAC AATCCAATGA TGGAAGATcG CATTAGCTAC ACCACCCTGC 

1801 GCTTTCcCGA GATGAACATA CCACGAACTG GAGATGCAGA GTCCTCAGAG 

1851 ATGCAGAGAC CTCCCCCGGA CTGCGATGAC ACGGTCACTT ATTCAGCATT 

1901 GCACAAGCGC CAAGTGGGCA CTATGAGAAC GTCATTCCAG ATTTTCCAGA 

1951 AGATGAGGGG ATTCATTACT CAGAGCTGAT CCAGTTTGGG GTCGGGGAGC 

2001 GGCCTCAGGC ACAAGAAAAT GTGGACTATG TGATCCTCAA ACATTGACAT 

2051 GGATGGGCTG CAGcAGAGGC ACTGGGGGCA GCGGGGGCCA GGGAAGTcCC 

2101 CGAGTTT 
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FIG. 15. 

U8ST1T0TS SHEET 
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PCT/US88/02809 



1701 CCCCAATGTC CAGCCTCTTT AACATCTTCT TTCCTATGCC CTCTCTGTGG 

1751 ATCCCTACTG GTGGTTTCTG CCTTCTCCAT GCTGAGAACA AAATCACCTA 

1801 TTCACTGCTT ATGCAGTCGG AAGCTCCAGA AGMCAAAGA GCCCAATTAC 

1851 CAGAACCACA TTAAGTCTCC ATTGTTTTGC CTTGGGATTT GAGAAGAGM 

1901 TTAGAGAGGT GAGGATCTGG TATTTCCTGG ACTAAATTCC CCTTGGGGAA 

1951 GACGAAGGGA TGCTGCAGTT CCAAAAGAGA AGGACTCTTC CAGAGTCATC 

2001 TACCTGAGTC CCAAAGCTCC CTGTCCTGAA AGCCACAGAC AATATGGTCC 

2051 CAAATGACTG ACTGCACCTT CTGTGCCTCA GCCGTTCTTG ACATCAAGAA 

2101 TCTTCTGTTC CACATCCACA CAGCCAATAC AATTAGTCAA ACCACTGTTA 

2151 TTAACAGATG TAGCAACATG AGAAACGCTT ATGTTACAGG TTACATGAGA 

2201 GCAATCATGT AAGTCTATAT GACTTCAGAA" ATGTTAAAAT AGACTAACCT 

2251 CTAACAACAA ATTAAAAGTG ATTGTTTCAA GGTGAAAAAA 
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FIG. 16. 
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1701 GGGTCTCAGA GCTGAGCCAA GACCTCCCGG GTCCTCTGCG Gl I 1 1 1 IGTG 

1751 TCTTTCAGAG TGGCTGGTGC TCCAGACCCC TCACCTGGAG TTCCAGGAGG 

1801 GAGAAACCAT CGTGCTGAGG TGCCACAGCT GGAAGGACAA GCCTCTGGTC 

1851 MGGTCACAT TCTTCCAGAA TGGAAAATCC AAGAAATTTT CCCGTTCGGA 

1901 TCCCAACnC TCCATCCCAC AAGCAAACCA CAGTCACAGT GGTGATTACC 

1951 ACTGCACAGG AAACATAGGC TACACGCTGT ACTCATCCAA GCCTGTGACC 

2001 ATCACTGTCC AAGCTCCCAG CTCTTCACCG ATGGGGATCA TTGTGGCTGT 

2051 GGTCACTGGG ATTGCTGTAG CGGCCATTGT TGCTGCTGTA GTGGCCTTGA 

2101 TCTACTGCAG GAAAAAGCGG ATTTCAGGTT TGTAGCTCCT CCCGGTCCCT 

2151 TTTGTTATCA GTTTCCACTT T 
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